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Preface 



Modern astronomy has stretched its domains of exploration tremendously. 
Not only objects at very large distances and very old states of the Universe 
can be examined, but also all kinds of radiations and phenomena are now 
accessible. Astronomers constantly move from considerations about very di- 
luted to very dense systems. 

Hot and energetic systems, being the easiest to observe, have attracted a 
lot of attention. However the cold and low energetic states have been some- 
what neglected, either because being harder to observe they appear unexcit- 
ing, or because being less well known they tend to be ignored. However the 
Universe background radiation has now been determined as the most perfect 
known case of a black-body spectrum, a substantial fraction of matter spends 
some time close to the temperature of this universal thermal bath, before be- 
ing transformed into stars or planets. Some objects, such as rapidly expanding 
gas shells in planetary nebulae, may even succeed in reaching a temperature 
well below the background radiation temperature through the mere action 
of adiabatic expansion. In view of the highly dynamical and turbulent state 
of the interstellar medium, hot and cold temperature fluctuations must be 
expected, while the clear observational bias is to observe the hot rather than 
the cold fluctuations. Fortunately with the accessibility of far-infrared and 
sub-millimetric instruments such as SCUBA, WMAP, Planck or ALMA, we 
can expect in the coming years continuous advances in our understanding of 
these harder to observe cold stages of matter. 

For these reasons the members of the Swiss Society for Astrophysics and 
Astronomy voted in favor of the organisation of a winter Saas-Fee course on 
the Cold Universe. The aim of this course was to cover as well as possible the 
current knowledge in observational and theoretical astrophysics dealing with 
the coldest objects in the Universe. The selected lecturers, Andrew Blain, 
Frangoise Combes and Bruce Draine have succeeded in bringing to the 70 
participants a very rich and interesting amount of information, that the reader 
can enjoy now in the following pages. We are very grateful to the lecturers 
for their invaluable live lectures as well for their presently available written 



versions. 




VI 



Preface 



We thank also the course Secretary, Ms Irene Scheffre, and our colleagues 
Paul Bartholdi, Veruska Muccione, Yves Revaz, and Nunos Santos, for all 
their help in the practical organisation of the course. 

As as reminder the course took place in the picturesque village of Gri- 
mentz in the south Alps of Switzerland. As a practical introduction to the 
Cold Universe, the participants could taste ice-cream made on the spot with a 
mixture of rasberries, strawberries, liquid cream and liquid nitrogen! The pic- 
ture shows from left to right Yves Revaz, Paul Bartholdi, Veruska Muccione 
and Frangoise Combes enjoying the cooling process. 




Geneva, July 2003 Daniel Pfenniger 

Arnold Benz 
Thierry Courvoisier 
(The Course Organisers) 
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Galaxy Formation and Evolution 
in the Cold Universe 



Andrew W. Blain 

California Institute of Technology, Astronomy 105-24, Pasadena, CA 91125, USA 



1 Introduction 

This part of the proceedings discusses our knowledge of the way that galaxies 
formed, and the information that we can gain about the process by observing 
the radiation from the cold components of the Universe. The physical pro- 
cesses involving the molecular and atomic gas and solid-phase components 
of the interstellar medium (ISM) in galaxies are discussed in great detail in 
the other contributions. Here we will concentrate on the continuum emission 
processes that are relevant to studies of distant galaxies, mentioning briefly 
the importance of molecular studies. Little new ground will be broken in de- 
scribing the processes at work as galaxies form. This is an extremely large 
and active research area, and one that has been described comprehensively 
elsewhere, including at this school in past years, sometimes touching upon 
the cold Universe theme [183]. Some current issues in the study of galaxy 
formation at long wavelengths will be raised, and the detailed physics and 
chemistry being discussed alongside [84,103] will be put into the context of 
our rapidly developing observationally driven understanding of the way in 
which galaxies formed. In addition, the discussion will be focused on how 
observations of the thermal emission from the constituents of the Universe at 
temperatures from a few lO’s to a few lOO’s of degrees are moving into the 
mainstream of observational cosmology, and the illumination of some of the 
directions that these studies will take in the years ahead. 

There are two main ways to probe the distant cold Universe. First, ob- 
serving the radiation from galaxies at mid-infrared (mid-IR) and longer wave- 
lengths (A > 10 /im) that is emitted by atomic and molecular gas and grains 
of dust at temperatures of order 10 — 1000 K. Line emission from molecular 
and atomic gas plays an important role in cooling dense gas clouds in the ISM, 
although little observational information is available about these phenomena 
early in the history of the Universe. In terms of bolometric luminosity, thermal 
emission from dust is certainly the dominant process at far-IR wavelengths 
out to moderate and high redshifts, z ~ 1—2 and z > 2—3 respectively. Unless 
extreme redshifts are being investigated, the emission from stars makes only 
a very small contribution to the signal detected at wavelengths longer than 
10 /im. Non-thermal synchrotron emission from high-energy electrons is also 
unlikely to be significant, apart from in very rare powerful radio-loud active 
galactic nuclei (AGN). Dust grains, the solid-phase of the ISM, and atomic 
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and molecular ISM gas all radiate at wavelengths between a few microns and 
several millimetres. Grains are heated radiatively by optical and UV radiation 
from young stars and AGN. They could also be excited by mechanical en- 
ergy deposited by shocks generated by interactions between galaxies, clouds 
within galaxies and supernova, but this is likely to provide only a very small 
fractional correction to the luminosity inferred from radiative heating. 

Second, observing the effects of absorption by neutral and low-excitation 
gas along the line of sight to distant luminous objects. While this is pre- 
dominantly a technique for optical observers, absorption line measurements 
of molecules backlit by intense radio sources have been made, revealing very 
detailed information about the properties of the ISM in galaxies at moderate 
redshift [268]. As the simultaneous spectral coverage of submillimetre- wave 
instruments increases and their sensitivity steadily improves, absorption line 
studies offer to become an important part of the armoury for studying the 
cool parts of the Universe. 

Additional probes can be provided by exploiting features that are visible 
against the Gosmic Microwave Background (GMB) radiation at millimetre 
wavelengths by scattering along the line of sight. Fluctuations in the GMB, 
both spatial and spectral, typically at the 10“® — >■ 10“^ level, are critical 
probes of cosmology. Spatial anisotropies in the GMB emission encode direct 
evidence about the galaxy formation process. Primordial features in the spec- 
trum of the GMB trace activity in the early Universe, while foreground ob- 
jects, which potentially include the very earliest galaxies and clusters of galax- 
ies, can impose a variety of detectable characteristic ‘secondary’ anisotropies 
to the GMB signal. Both primary and secondary features of the GMB are 
already being detected. The most prominent is the Sunyaev-Zeldovich (SZ) 
effect [255,70], produced by electrons at X-ray-emitting temperatures, of or- 
der 10®K/10keV, which has been clearly detected along the line of sight to 
clusters of galaxies, and may develop into an important probe of less massive 
structures at high redshifts. The intensity of the GMB increases with redshift 
0 as {1 -\- z)'^, exactly counteracting the reduction in surface brightness with 
increasing redshift that affects all cosmological observations. 

Direct observations of the submillimetre and far-IR emission from high- 
redshift galaxies are amongst the most recent techniques to be applied to the 
study of galaxy formation, beginning in 1997 [57]. This is largely because the 
technology to detect distant galaxies has matured later at submillimetre to 
mid-IR wavelengths than at radio and optical wavelengths [215]. The results 
discussed are generally all less than 5 years old, and so most of the relevant 
information was gathered since the extremely deep optical Hubble Deep Field 
(HDF) image was taken in 1995 [269]. Hopefully, the descriptions presented 
here will remain valid for many years to come; however, this is a young, very 
active field, and some of the results may be overturned when more data, secure 
multiwavelength galaxy identifications, and more time to contemplate the re- 
sults are available. Observational capabilities continue to develop rapidly, in 
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almost all the windows into the distant Universe. Some of the most relevant 
developments outside of the far-IR/submillimetre sphere are at radio, near- 
IR, optical and X-ray wavelengths. At radio wavelengths, the Very Large 
Array (VLA) will soon be upgraded to provide significantly improved sensi- 
tivity and resolution, and drastically improved spectroscopic capabilities. The 
UK-based MERLIN radio interferometer is also being upgraded dramatically, 
and the Indian Giant Metrewave Radio Telescope (GMRT) is being commis- 
sioned with a very large collecting area at the longest radio wavelengths. 
The number of 8-m-class optical/near-IR telescopes in service is increasing 
steadily, and full suites of instruments are being delivered for them. Wide- 
held multi-object spectroscopic capability at the Keck, Gemini, Subaru and 
VLT telescopes are on the verge of generating a new level of detail in deep 
galaxy redshift surveys, while the shallower, wide-area 2-degree-held (2dF) 
redshift survey is complete with about 230,000 redshifts [83], and the Sloan 
Digital Sky Survey (SDSS) [58] is well underway, generating galaxy redshifts 
at the rate of over 10^yr“^. The Chandra X-ray observatory has provided 
the hrst arcsec-resolution images of the very opposite of the cold Universe, 
and together with the greater collecting area, but coarser resolution provided 
by the XMM-Newton telescope, both oher excellent spectroscopic imaging of 
the high-energy Universe. 

All of these facilities will have an impact on our understanding of the dis- 
tant Universe and the genesis of galaxies. The importance of observing over 
as wide a range of wavelengths as possible, and the ways in which existing 
observations and theories can be extended and improved by adding informa- 
tion from the cooler parts of galaxies, will be emphasized. In the final section 
we will report on developments in instrumentation that should lead to a clear 
high-resolution view of the formation of galaxies in the years ahead. 

The other parts of this book describe the details of the radiation pro- 
cesses that take place in the ISM. When studying galaxy evolution at long 
wavelengths, it is generally difficult to make very detailed observations, both 
because of the restrictions of spatial resolution, and due to there being only 
a few flux density measurements available from an entire distant galaxy. It is 
therefore important to remember that extragalactic astronomy at thermal IR 
to millimetre wavelengths lacks the precision of studies of molecular clouds 
in the Milky Way and nearby galaxies. 

A practical long-wavelength limit to the spectral window in which galaxy 
evolution can be probed by observing the ISM is imposed by the GMB, which 
dominates the energy density in radiation in the Universe at the present epoch 
by about two orders of magnitude. As the total energy output of galaxies 
appears to be declining steadily with cosmic time at present, and it is likely 
that even as the Universe expands further and redshifts away the GMB energy 
density, GMB photons will not only remain the most numerous photons in 
the Universe but will also always dominate the energy density in the diffuse 
intergalactic cosmic radiation field, away from individual galaxies. 




4 



Andrew W. Blain 



At the present epoch, the CMB radiation spectrum peaks at a wavelength 
of 2 — 3 mm. The CMB is the most accurate blackbody known. Averaged 
over the whole sky, its temperature Tq = 2.726 ± 0.001 (at a 95% confidence 
limit)[190], and increases with redshift z as To(l + z). This temperature sets 
a lower limit to the temperature of any constituents of the Universe that 
interact strongly with radiation, such as electrons, dust grains and polar 
molecules. Because only Universal contents with this property are likely to 
radiate efficiently, then sources of thermal radiation at wavelengths longer 
than about 3 mm need not be considered. 

In the context of galaxies and their constituent stars and gas clouds, the 
CMB is usually a relatively insignificant source of heat. The temperature 
associated with the interstellar radiation field (ISRF) in galaxies and the 
temperatures equivalent to the velocities associated with outflows, winds and 
galactic dynamics are usually much higher. For example, the Milky Way has 
a rotation velocity of order 220kms“^, and the dispersion in velocity across 
a molecular cloud is typically 10 — 20 km s“^. The typical temperature of dust 
grains throughout the ISM is only 15 — 20K, still much greater than the CMB 
temperature. However, given that the volume of space occupied by galaxies 
is much less than 10“®, and the CMB is all pervading, the energy density 
in the CMB is large, corresponding to 4.0 x 10“^^ kgm“^, as compared with 
the 1.88 X kg m“^ in the form of regular matter. Since the density 

parameter — 0.3 and Hubble’s constant h ~ 0.65 (see later), the CMB 
contains 0.017% of the energy density in matter in the Universe at the present 
epoch. 

This discussion will concentrate on thermal emission at wavelengths be- 
tween the mid-IR and millimetre wavebands. The mid-IR spans wavelengths 
from about 5 — 40 /xm, starting at wavelengths where stellar emission no 
longer dominates the energy output of galaxies; the far-IR waveband covers 
from about 40 — 200 fxm; and the submillimetre and millimetre wavebands 
span from about 200 — 1000 /xm and longer than 1000 ^m respectively. This 
wavelength range is bounded at the long- wavelength side by the CMB. Obser- 
vational capabilities throughout this spectral region are developing rapidly. 

1.1 The Difficulties of Observing Faint and Distant Cold Objects 

In comparison with the oldest branches of observational cosmology that probe 
the distant Universe by detecting galaxies at optical and radio wavelengths, 
studies at infrared wavelengths were slow to develop. This reflects the need to 
overcome several physical challenges to imaging faint objects between infrared 
and submillimetre wavelengths. 

The first challenge is one of resolution. We are accustomed to seeing spec- 
tacular, high-resolution, multicolour views of the most distant galaxies ob- 
tained by the Hubble Space Telescope (HST), including the Northern and 
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Southern HDFs (HDF-N & HDF-S[270]).^ At longer wavelengths, the resolu- 
tion of images from a telescope of fixed aperture D falls steadily, and images 
cease to be arresting as soon as the ability to resolve the sub-arcsecond scale 
structures within galaxies at moderate redshifts is lost. In the absence of 
seeing this occurs at wavelengths longer than about 4[Z?/m]“^ /rm. At near- 
IR wavelengths diffraction-limited images retain excellent spatial information 
content; however, at wavelengths longer than 10 /xm, the aperture required 
to resolve distant galaxies grows towards 10 m. At still longer wavelengths, 
arcsec resolution cannot be achieved without the use of interferometry with 
baselines in the 100-1000 m range. The largest apertures flown in space to 
date are just 0.6m aboard both the Infrared Astronomy Satellite (IRAS) in 
1984 and the Infrared Space Observatory (ISO) in 1995. The Space Infra-Red 
Telescope Facility (SIRTF), scheduled to fly early in April 2003, has a 0.85 m 
aperture. The largest planned space aperture at long wavelengths is cur- 
rently the 3.5 m Herschel Space Observatory (formerly FIRST) ESA/NASA 
far-IR/submillimetre mission scheduled for launch in 2008, and the proposed 
Japanese 4-m class cold-aperture SPICA^ mission. 10 — 20m-class single- 
aperture mid- and far-IR space telescopes, such as SAFIR, and interferome- 
ters with longer baselines [191] have been proposed. 

If the resolution is coarsened further, to levels insufficient to separate ad- 
jacent galaxies on the sky, then the ultimate depth of observations is limited 
not by the ability to integrate longer and suppress photon noise, but by the 
unknown distribution of faint unresolved sources on the sky that cannot be 
detected individually. This problem of ‘confusion noise’ dogged early radio 
and X-ray observations. It is also a significant potential problem for obser- 
vations using SIRTF, especially for the deepest surveys to probe the high- 
redshift Universe. As detectors become more sensitive, confusion registers as 
an ever more pressing problem: if the relevant depth cannot be reached, due 
to noise in the instrument, then confusion is not an issue! At submillimetre 
wavelengths from the ground, it currently takes many lO’s of hours of inte- 
gration to reach the confusion limit. With a cooled telescope in space it could 
potentially take only seconds. At the longest wavelength probed by SIRTF 
-160 /im - confusion noise is likely to dominate other sources of noise in 
only about 10 s; although the time required will be much greater at shorter 
wavelengths. 

A second challenge is a physical one in the detectors themselves. There 
are two major relevant regimes. At the longest wavelengths, it is natural to 
make ‘heterodyne’ electronic detections of the electric held of radiation as it 
interferes with a very stable reference ‘local oscillator’ (LO) signal in a non- 
linear detecting element of comparable size to the wavelength of detected 

^ The Advanced Camera for Surveys (ACS) has just been installed on HST to 
provide even more sensitive and larger images than those from the existing Wide 
Field and Planetary Camera (WFPC-II). 

WWW. isas .ac.jp 
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radiation, or in a smaller detector coupled by a waveguide. This produces a 
beat ‘intermediate frequency’ (IF), whose processing dramatically reduces the 
frequency at which the receiver electronics are required to operate. The full 
spectral information is recovered by auto-correlating the IF output in time. 
This works extremely well at frequencies less than of order several hundred 
GHz. Efficient detection becomes more and more difficult as the frequency 
increases: it becomes challenging to manufacture the millimetre-scale com- 
ponents necessarily, to simulate the electromagnetic fields in the detector, 
and to generate precise LO signals at the relevant frequencies. Cryogenic 
heterodyne receivers fitted to radio, millimetre and submillimetre telescopes 
currently perform at close to the theoretical quantum efficiency limit set by 
the photon energy at these frequencies. Heterodyne devices at frequencies 
greater than 1 THz have been tested, but it is still very challenging to design 
and manufacture low-noise receivers for submillimetre wavelengths. 

At the shortest wavelengths it is more natural to detect the energy de- 
posited by a photon in the detector. This typically requires the promotion of 
electrons above a band gap in some semiconductor device, or the detection 
of the temperature change caused by deposited energy using a bolometer. At 
optical wavelengths this is a mature technology: CCD devices operate with 
quantum efficiencies over 90%. At longer wavelengths the technology is more 
challenging. It is essential to cool the detectors so that the bandgap remains 
much larger than the typical thermal energy of the electrons in the device. At 
wavelengths longer than about 10 /im this requires cryogenic temperatures, 
and very low cryogenic temperatures for wavelengths longer than 200 /im. 
Self-stabilizing superconducting sensors operating at their transition temper- 
ature due to thermo-electric feedback, and other innovative detection devices, 
offer dramatic future gains for bolometer sensitivities at submillimetre wave- 
lengths. The advantage of stability is that the bolometer current/ voltage that 
measures the intensity of illuminating radiation does not need to be read out 
constantly, but just twice per thermal response time, thus reducing the num- 
ber of wiring connections required. This both limits the conduction of heat 
into the detectors and simplify their manufacture. Some novel devices are 
read out using microwave radiation, allowing their designers to exploit ad- 
vanced yet inexpensive mobile telecoms technology to address the detectors, 
thus eliminating wiring heat load entirely. 

The reasons for the difficulties of detecting 10 — 1000 /xm sources, and why 
the first deep surveys at these wavelengths were made only in the late 1990’s 
is now hopefully clear. 

A third challenge is the effects of the atmosphere and optics. Putting a 
large cold aperture in space has not so far been practical, either financially or 
technically, and so observations have been carried out from small space tele- 
scopes or from the ground. Thermal radiation from the atmosphere and the 
telescope optics mask mid-IR signals. Molecules in the atmosphere, especially 
O 2 , OH and H 2 O, both emit and absorb powerfully, ruling out observations 
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in specific bands. With the exception of wavelengths shortward of the far- 
ultraviolet(UV) atmospheric cutoff, all the opaque parts of the atmosphere 
are found between the submillimetre and near-IR wavebands. 

These factors - technical difficulty of reaching sensitive detection limits, 
absorption and masking emission by the atmosphere and limited resolution 
- have all conspired to keep the submillimetre and infrared windows closed 
to sensitive observations long after the optical and radio windows had been 
substantially investigated. There is much to learn from this previously hidden 
window. 



2 Radiation from the Cool ISM in Galaxies 
and Its Detection 

2.1 Galaxy Structure 

The ISM of galaxies includes a variety of phases in a very complex interlinked 
geometry. The stellar populations of large low-redshift galaxies can usually 
be divided easily into an old, evolved, passive, pressure-supported spheroidal 
component at the centre, with a scale height of up to a few kiloparcsecs 
(kpc)^; and a more extended, rotationally-supported disk component with a 
scale length of typically several kpc, that contains stars with a much broader 
age distribution, and a much greater proportion of gas. In elliptical galax- 
ies the spheroid is dominant; whereas spiral galaxies are dominated by the 
disk component. Spiral and elliptical galaxies dominate optical atlases of the 
nearby Universe. However, these relatively serene and static objects are not 
the galaxies most likely to be found when investigating the evolution and 
formation of galaxies [25]. They are the almost fossilized end-points of the 
galaxy formation process, and so while any model or theory must explain 
their existence, it is the more dramatically active and luminous objects that 
are most important at infrared wavelengths. 

Irregular galaxies do not fit into the elliptical/spiral description. They are 
relatively rare in the local Universe and were relegated by Hubble to an extra 
box, disconnected from the main tuning fork of his galaxy morphology dia- 
gram. Their disorganized structure is taken to be evidence of either a recent 
strong tidal disturbance, or the debris remaining from a merger or interaction 
with a neighbour. By stirring up the ISM, these processes could easily lead to 
an enhanced rate of star formation: irregular galaxies are typically blue, and 
contain some very young stars. Interacting galaxies, for example the Anten- 
nae [196,271] have very disturbed ISMs. The disks of the two spiral galaxies 
involved in this interaction are deeply entwined. A large density of gas has 
been funneled into the region between the galactic nuclei, and a minority of 
the mass of the disks has been flung out in long (100 kpc), thin tails of tidal 
debris, extending far beyond the extent of the disks prior to the interaction. 
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When observed at infrared and submillimetre wavelengths, irregular and 
interacting galaxies often show the most powerful emission [198]. They are 
the objects that should come to mind immediately when powerful infrared 
galaxies are discussed. 

In addition, a small fraction of low-redshift galaxies contain AGN, with 
very bright, point-like nucleus that is naturally interpreted as the accretion 
disk surrounding a massive blackhole. AGN detected at optical wavelengths 
in the form of extremely bright, blue/UV quasars/QSOs, and at radio wave- 
lengths as radio galaxies with extended double lobes of synchrotron emis- 
sion, were the first high-redshift objects to be discovered and identified. The 
detected increase in their space density with increasing redshift to 0 ~ 2, 
combined with the eventual discovery of a convincing decline in their space 
density at still greater redshifts [236,104] provided the first evidence for strong 
evolution of galaxy populations over the Universe’s history. 

In some cases, AGN could result from intense, central clusters of OB stars 
in the nuclei of galaxies, while strong star-formation activity is often likely to 
be associated with the accumulation of gas that goes on to feed a blackhole. 
This feeding process is likely to be most efficient during galaxy interactions, 
as gas is driven inwards by tidal forces, and the orbits of gas and stars will be 
disrupted, refilling orbits that cross the accretion disk of the blackhole. Peri- 
odic blackhole accretion/AGN activity matched to enhanced star-formation 
activity during galaxy interactions seems likely to be an important part of 
the story of galaxy evolution, and one that is likely to excite emission from 
the ISM in regions heavily enshrouded by dust, and thus easiest to study at 
long wavelengths. 

2.2 AGN Accretion and Star Formation 

The question of the relative roles of heating by AGN and young, high-mass 
stars is an important one, as the most luminous inferred galaxies could be 
powered in almost any proportion by accretion onto an AGN or the forma- 
tion of massive stars. The efficiency of conversion of mass to energy in star 
formation is limited to the ~ 0.7% of rest-mass energy that can be released 
by fusion between hydrogen and iron, but for accretion to the radius of the 
last stable orbit of a supermassive blackhole the efficiency could reach about 
30%, and so much less mass is required to generate luminosity by fueling an 
AGN than by forming stars. 

The recent discovery of an almost linear correlation between the mass or 
velocity dispersion of the evolved stellar bulge component in galaxies and the 
inferred mass of the central blackhole [188,120,135] can inform this issue. A 
few parts per thousand of the mass of the most homogeneous, oldest and 
tightly-bound stellar populations in most galaxies is in the form of a black- 
hole. This is certainly true of the Milky Way, in which a 2.9 x 10® Mq compact 
object lurks at the Galactic Gentre within the ~ 10® Mq bulge [137,111]. 
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The processes setting these stellar and blackhole masses would seem to 
be connected, with a clear role for feedback between accretion and star- 
formation in the centres of galaxies [239]. Accretion could perhaps regulate 
star formation in the bulge. With an abundant fuel supply, a blackhole can 
accrete at the Eddington rate, gaining mass exponentially with radiation 
pressure on electrons balancing gravitational forces on protons. Direct or 
scattered radiation from the AGN extends its influence further and further 
into the galaxy, far into low-ionization regions of the bulge, producing an 
increasingly powerful, radiation-pressure driven outflow, clearing out gas from 
the central regions of the galaxy. Beyond a threshold strength depending 
on the size and mass of the host galaxy, this would prevent both further 
star-formation and limit accretion as fueling gas is expelled. An alternative 
picture in which an efficient burst of star formation in the bulge consumes 
all the available gas, terminating accretion on the same timescale as the 
star-formation is also possible [12]. Seed blackholes could form by the direct 
gravitational collapse of primordial overdense regions [145], from core collapse 
in an early generation of supermassive stars, or from the collapse of dense 
star clusters [110]. Seed blackholes could gradually accrete baryonic and dark 
matter from the surrounding halo inefficiently [186], invisibly generating the 
observed correlation, if the properties of the dark matter halo are tuned 
correctly. Independent star-formation activity would take place both earlier 
and later, to form a surrounding bulge. To arrive at the observed correlation 
by the present epoch, would require all relaxed stellar bulges to have similar 
velocity and density structure, and for the process of blackhole accretion 
having proceeded for a similar time in all cases, which does not seem very 
likely. The limited scatter in the blackhole-bulge ‘Magorrian relation’ implies 
that the bulge and blackhole formation processes are linked. 

This general discussion of the structure of galaxies and their possible 
formation histories is intended to illustrate the importance of their infrared 
emission for studying galaxy formation. However, only the closest galaxies 
can be resolved at these wavelengths, and only the disk components and 
nuclei of galaxies are detectable: ongoing star formation or AGN activity is 
essential for heating dust. Furthermore, there is no way to resolve dispersed 
star-formation activity from a much smaller AGN (not that this is necessarily 
easy in the case of exquisite angular resolution [262]). 

2.3 The Structure of Infrared Emission 

Without angular resolution, infrared observations provide a single measure- 
ment of the integrated energy emitted by all parts of a galaxy. This is in 
marked contrast to the resolved observations possible for star-forming clouds 
in the Milky Way. Hopefully, this situation will improve, with 0.1 — arcsec 
resolution becoming available with the commissioning of the SubMillimeter 
Array (SMA) [148] on Mauna Kea, with a 500 — m maximum baseline and 
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the Atacama Large Millimeter Array (ALMA) [272] with a maximum base- 
line longer than 10 km. Limits to the angular size of very luminous infrared 
galaxies at millimetre wavelengths are available, from the existing OVRO 
Millimeter Array (MMA)[157j and IRAM Plateau de Bure (PdBI) interfer- 
ometers [225,100]. 

However, spatially resolved submillimetre images are not currently avail- 
able, and none are foreseeable at far-IR wavelengths, prior to a space-borne 
telescope with an aperture or baseline larger than about 10 m. As a result, 
it is important to be aware that all sources of infrared emission from distant 
galaxies are superimposed, and can be discriminated only in the spectral 
domain. 

To investigate the spectral energy distribution (SEDs) of infrared-lumin- 
ous galaxies at optical or infrared wavelengths, it is not presently either pos- 
sible to constrain adequately or necessary to use radiative transfer techniques 
to determine the dust distribution in distant galaxies, due to the lack of de- 
tailed information about its spatial structure. Simplifying assumptions can 
be made when considering radiative transfer at longer wavelengths, how- 
ever. The main one is that, once emitted, a far-IR photon is likely to escape 
from the galaxy unhindered, without scattering or absorption. The opacity 
of dusty regions of galaxies can be extremely high at the optical and UV 
wavelengths where the sources heating the dust are most luminous, and the 
absorption efficiency of dust is greatest: there are up to 20 magnitudes of 
P-band extinction to the Galactic Centre, and at least 50 magnitudes to 
the centres of the most heavily-enshrouded AGN. However, even at 2.2 /rm 
the opacity has reduced enough for the most luminous stars in the Galactic 
Center can be discerned [111,137]. The optical depth at wavelengths of order 
20 /im is expected to be relatively small, while at wavelengths longer than 
of order 100 /rm the galaxy the opacity of the overlying ISM is less, by of 
order 100, and so general sightlines through galaxies are optically thin or 
only marginally optically thick. Only the very densest clouds are likely to re- 
main optically thick at longer wavelengths. Hence, illuminating star-forming 
regions and AGN can all be considered as point sources, heating dust locally, 
and contributing to the general ISRF that heats dust on larger galaxy-wide 
scales. At longer wavelengths, radiation escapes from the galaxy without fur- 
ther interactions, and so this part of the radiative transfer leading to the 
far-IR SED is easy to deal with. 

Existing radiative transfer studies have generally attempted to discrimi- 
nate between point-like AGN and star-formation regions distributed through- 
out the galaxy [141], or to predict expected SEDs for star-forming galaxies 
from semi-analytic galaxy formation models [96]. Note, however, that these 
models are restricted to very constrained geometries: cylindrical or spherical 
distributions, and sources embedded within disks. It is unlikely that the most 
luminous and interesting galaxies will conform with these simple geometries, 
and so care is still required when interpreting the SEDs of these objects. 
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Predicting the SED of a dusty galaxy at long wavelengths is much more 
straightforward than at optical wavelengths, where scattering and attenua- 
tion occurs throughout the ISM as a photon makes its way out of a galaxy. 
In contrast, the reprocessed SED of each dust cloud as detected on Earth 
at long wavelengths should be very similar to the spectrum emitted by the 
cloud within the galaxy. The difficult step is treating the processing of optical 
and UV photons within the probably turbulent and inhomogeneous regions 
around AGN and young stars, and the neighbouring dusty molecular clouds. 
For AGN, scattering and absorption from clouds at considerable distances 
from the nucleus could be important - especially in a merging system, if the 
intense escaping emission from the AGN in one galaxy shines on the disk 
of its neighbour. If predicting the far-IR properties of a dusty galaxy is rela- 
tively easy, apart from the issue of geometry, predicting its optical properties, 
and the fraction of energy that will be absorbed by dust is effectively impos- 
sible. This may change when data with comparable resolution from ALMA 
at submillimetre wavelengths and the James Wehh (Next Generation) Space 
Telescope (JWST) in the near-IR are available for large numbers of galaxies. 

The far-IR emission produced by a point-like AGN/nuclear star-forming 
region, and hot stars distributed more widely through the ISM can be de- 
scribed by a distribution of dust temperatures and relative luminosities at 
each temperature. Almost all the hot dust within the galaxy will be con- 
tained in these regions, but their intense radiation field and strong outflows 
are likely to disperse the dust on timescale comparable to the lifetime of a 
high-mass star. Hence the youngest star-formation regions are likely to start 
out invisible at optical and UV wavelengths, and to be detectable only in 
the infrared. Understanding the optical/UV appearance of the galaxy will 
be further complicated by this time-dependent appearance of different dust- 
enshrouded star-forming regions. The optical luminosity of a dusty galaxy is 
thus likely to increase at later times as the dust is dispersed [160]. The ISM 
that fills the space between high-luminosity regions will be illuminated by the 
general ISRF of the galaxy, contributed both by scattered and escaping op- 
tical/UV radiation from the densest luminous regions and from longer-lived 
low-mass stars throughout the galaxy, and the GMB. 

2.4 Dust Emission at the Coarse Level Relevant to Cosmology 

The complex details of heating of the solid-phase ISM are explained in much 
more detail elsewhere [103]. Here we discuss the issues that are most relevant 
to studying and understanding unresolved infrared galaxies. This is not to 
deny that a deeper understanding of the physics and chemistry of the ISM 
is essential; rather it expresses the difficulty of constraining these details 
from existing observations. It is important to retain a healthy scepticism 
about reports that a certain galaxy contains dust grains with a precise dust 
temperature or temperature distribution, or that any specified dust mass in 
a galaxy is quoted with an error less than 200%. 
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On a grain-by-grain basis, the thermal emission from dust has a Planck 
function SED B^, defined by dust temperature T, modified by an emissivity 
function while at the longest wavelengths, scattering theory indicates that 
€ (X with (3=2, while at the shortest wavelengths, ~ 1. The observed 
value from fitting the SEDs of galaxies at 1 — 0.1mm is of order 1.3 — 1.5 
[107]. This single composite value reflects complex geometry, variable optical 
properties of the grains, and an inevitable distribution of dust temperatures. 
At shorter wavelengths, grains at temperatures less than 100 K emit on the 
Wien side of the spectrum, and an exponential SED is expected. This will 
be modified by small grains heated out of equilibrium by absorbing single 
high-energy photons before cooling radiatively back to an equilibrium tem- 
perature of 10 — 100 K. Also, the small fraction of grains very close to hot stars 
and AGN will be heated to higher temperatures than grains in the general 
ISRF, and emit at shorter wavelengths. Galaxies are observed to have mid-IR 
SEDs closer to a power-law than an exponential function, due to the summed 
emission from these hotter components (Fig. 1). Dust sublimes when heated 
to temperatures of order 2000 K. At the corresponding wavelength of about 
2 /rm, low-mass starlight starts to dominate the SED, and so the inevitable 
Wien tail of the near-IR dust spectrum from the hottest grains present at 
higher frequencies is masked by stellar emission. 

The far-IR SED of a galaxy is the sum of many different components. 
However, as most galaxies are described only by up to ten data points all 
the way from mid-IR to millimetre wavelengths, it is reasonable at present to 
reduce the number of parameters and components involved to about three. 
If these parameters are well chosen, then the results will be physically rel- 
evant. We adopt the simplest parametrization permitted by the data here. 
At long wavelengths, the SED is described by cx in the mid-IR 

waveband, a power-law spectrum with fn oc v°‘ is assumed. These functions 
are grafted together at the frequency where the logarithmic gradient of the 
long-wavelength SED B^) /dv, steepens beyond a. As shown in 

Fig. 1, this can provide a reasonable description of diverse observed galaxy 
SEDs. Without including the power-law spectrum in the mid-IR, predicted 
SEDs are too steep to agree with observations. 

In order to reflect the general features of the mid-IR SED of a galaxy made 
up of dust clouds at various temperatures, or to describe the summed emission 
from many galaxies at wavelengths from about 200-20 /rm it is reasonable to 
describe the spectra of the individual components as S functions, which makes 
the process of integrating over the different components very easy. A more 
detailed description is only called for at wavelengths that extend beyond the 
range of peak wavelengths of the SED [44] . 

The value of temperature T has a clear meaning. It determines the posi- 
tion of the peak of the SED, and reflects a luminosity- weighted temperature 
for the dust in the galaxy. f3 includes some information about the physics 
of dust grains, from their long-wavelength spectra, and about the relative 
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Fig. 1. Various observed multi-wavelength SEDs of the dust spectra of luminous 
galaxies, both nearby IRAS galaxies (I), distant submillimetre-selected galaxies (S), 
high-redshift lensed galaxies (L), and high-redshift AGN (H). Three model SEDs 
are shown for comparison [56,143,107], derived from fits to submillimetre and far- 
IR data, from fits to composite IRAS galaxy SEDs [91], and from submillimetre 
observations of low-redshift IRAS galaxies respectively. With the exception of some 
of the mainly lensed AGN, the templates provide a reasonable description. One 
of the models {solid line) includes GO line emission expected at submillimetre 
wavelengths, and another {dashed line) includes polyaromatic hydrocarbon (PAH) 
emission/absorption features at mid-IR wavelengths. Note, however, that the less 
luminous Milky Way has a temperature of order 20 K, approximately half of the 
40 K assumed in the templates of luminous galaxies, while the SED of the extremely 
luminous lensed QSO APM 08279-1-5255 [152] has a thermal spectrum with T ~ 
110 K. A wide range of dust temperatures is clearly present in galaxies. 



amounts of dust present at temperatures much less than T. a reflects rel- 
atively directly the fractional contribution from much warmer grains. Some 
very luminous AGN at high redshift clearly have a very flat mid-IR spectrum 
[175]. This implies that they have a very significant contribution from hot 
dust close to an AGN, or possibly an intense, but very inhomogeneous ISRF. 

Similar quality fits to data can be obtained by representing the SED of a 
galaxy using 2 components with different dust temperatures [108]; however, 
it is unclear how best to determine these temperatures and whether they 
have any direct physical relevance to a two-phase ISM [53] . 
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2.5 Dust Mass and Temperature 

The mass of dust in a galaxy is relevant for understanding its chemical evolu- 
tion prior to the observation. The creation, destruction and recycling of dust 
are important for understanding both the process of star formation, and the 
feedback of processed metal-rich material into the ISM, in all galaxies. This 
is a potential advantage of studying galaxy evolution from dust emission. 
First, the currently active star-forming regions are picked out clearly by their 
powerful dust emission, without any delay as stars to burn out of their dust- 
enshrouded nurseries to become visible at optical wavelengths. Secondly, the 
dust mass should give a direct measurement of the amount of heavy elements 
that have formed earlier, subject to few uncertainties about stellar astro- 
physics or the fraction of metals sequestered in stars, which can be probed 
using stellar spectroscopy. 

The geometry and size of dust grains is very significant for mass deter- 
minations, but less so for the general appearance of the galaxy SED. Dust 
grains, regardless of size or composition, have a high optical/UV opacity, and 
thus absorb efficiently, re-emitting the radiation at similar temperatures, with 
the caveat of small grains emitting briefly out of equilibrium in the mid-IR. 

It is not clear whether dust is composed of spherical, filled grains, or 
more exotic strongly ellipsoidal objects, whiskers, miniature rubble piles, or 
colloidal ‘snowflake’ structures. All but the first of these geometries could sig- 
nificantly reduce the dust mass required to produce a certain optical effect. 
If, for example, high-redshift dust is composed of delicate networks of carbon 
nanotubes in ‘snowflake’ grains, then the mass of carbon required to generate 
sufficient absorption to obscure early generation of stars could be very mod- 
est, as compared with the expectation for filled spherical ‘hailstone’ grains 
at low redshifts in the Milky Way. This would have a very significant effect 
on calculations of the prior cumulative amount of star formation activity re- 
quired to generate the observed opacity in high-redshift QSOs [32,153,201]. 
Uncertainties in the geometry of dust grains can also affect the likelihood of 
their destruction, for example by electrostatic stress built up by photoelectric 
emission by X-rays [129]. 

It is likely that moderate spectral-resolution observations of grains in the 
10 /rm wavelength range in the Milky Way and nearby galaxies could detect 
signatures of vibrational and loosely-bound electronic transitions in grains, 
perhaps revealing this type of information. However, it is unlikely that the 
evolution of dust properties could be studied at high-redshifts, as the spectral 
signatures would be too faint to detect. 

In any case, it is important to be aware of the potential for determin- 
ing some estimate of the dust mass from multi-wavelength observations of 
a galaxy SED between about 1 mm and 10 /im. As a galaxy is likely to be 
almost transparent to dust radiation, the total luminosity should be linked 
to dust mass, subject to the size of the grains a. The emissivity of a grain 
is linked to its area, which depends on a^. Mass depends on a^, and so the 
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specific emissivity e oc a“^: a cloud of dust is a better emitter than a cloud of 
pebbles with the same mass. It is also a better absorber. The interaction of 
radiation and dust is much more complex [103], but this is the level at which 
it is typically treated by extragalactic observers. 

The detected flux Si, from a cloud of dust with a certain dust grain size 
distribution should depend on the total mass M and the SED fi, of the 
cloud as S' oc f,y(T)M, while the luminosity L oc M jfydv oc . 

In the Rayleigh-Jeans regime, typical of submillimetre-wave observations, 
fu oc and so S oc MT. This means that for a certain measured 

long-wavelength flux density, the mass is determined as accurately as the 
temperature. However, the luminosity L cx , and is very poorly deter- 

mined without a measurement of the flux near the peak of the SED to fix the 
temperature accurately [109]. An example of the values of the parameters T 
and (3 required to fit a real dataset are shown in Fig. 2. Near the peak, the 
form of the SED depends weakly on temperature, and so mass and luminosity 
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Fig. 2. An illustration of the uncertainties in fitting dust spectra to limited SED 
data. The left-hand panel shows some of the best data available from SCUBA and 
IRAS, for NGC958 [108]. The right-hand figure shows the degeneracy in fitting a 
2-parameter SED model defined by a dust temperature T and emissivity index f3 
to the 4 data points. A fixed mid-IR index a = —1.95 is assumed; if a was also 
allowed to vary, then there would be more uncertainty in the permitted SED models. 
However, note that a is determining reasonably accurately from the spectral index 
between the 60— and 100 — /rm IRAS points. There is a great deal of freedom to 
exchange hotter T values and lower /3 values. The SEDs that correspond to both 
ends of the probability contours in the right-hand panel are shown on the left by 
the solid and thin dashed lines respectively. The 450 — /rm data point was obtained 
most recently. Without it, the thick dashed curve provided the best fit to the data 
[107]. Adding additional data points near the peak of the SED would reduce the 
width of the probability contours, but not remove the degeneracy between T and 
P (see [53]). 
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can be constrained more accurately. The effects of redshifting the spectrum 
are discussed below. 

In Fig. 3 the effects of assuming different functional forms for the SED 
are compared. An SED that includes an opacity [32], two different dust cloud 
components [108], and a power-law distribution of dust masses and temper- 
atures [91] are compared with the SED model introduced above. As can be 
seen from the caption, the value of temperature required to obtain a good fit 
to the data differs by a small factor. However, the simplest model can provide 
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Fig. 3. A comparison of the best-fitting models for the SED of NGC 958 (see Fig. 2), 
extended to include additional data from the NED database. Four template SEDs 
are included. The one described above [25], with T = 29K, /3 = 1-5 and a = — 2 . 1 . 
The value of j3 is assumed to be fixed at 1.5, and the other two parameters were 
fitted. One includes an optically thick dust cloud [32] with an emissivity function 
1 — exp(i^/r'opt)^. t'opt is assumed to be 2.4 x 10^^ Hz, = 1.5 and the best-fit 
temperature T = 33 K. The choice of the value of Vo^t within reasonable bounds 
has very little effect on the result. Another model includes two clouds of dust at 
different temperatures [108]: the cooler cloud is assumed to be at 20 K, emissivity 
indices = 2 are imposed on each cloud, and the best fit is obtained when the 
hotter cloud has T = 30 K and contains 12.6% of the dust mass of the cooler one. 
A mid-IR SED slope a = —2.1 is assumed in all three models. The final model 
includes a power-law distribution of dust temperatures [91], with m(T) oc with 
7 = —7.75 between temperatures of Tmin = 22K and 2000 K, an emissivity index 
P = 1.5 with Tmin and 7 determined by the data. Assuming the standard form 
of the far-IR-radio correlation [85] all models accurately reproduce the observed 
1.4-GHz flux density of NGG 958. 
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a good representation of the data, and it is not necessary, given the current 
data with low spectral resolution to be concerned with the effects of opacity 
and different dust components on the results. 

In order to investigate the SEDs of dusty galaxies in detail, spatially re- 
solved observations, capable of revealing all the star-forming regions with 
various ages, and the contribution from AGN, complex geometries and opac- 
ity distributions are required. Since these are not yet available, a reasonable 
description of the SED of a dusty galaxy can be provided by a dust tem- 
perature. That sets the position of the peak of the SED, and a distribution 
of dust clouds at hotter temperatures that contribute to an approximately 
power-law spectrum defined by a at shorter wavelengths. 

2.6 Long- Wavelength Observations of Low-Redshift Galaxies 

The only galaxies for which we have accurate and detailed SEDs that extend 
over the complete spectral range from millimetre-wave to near-IR wavelengths 
are at very low redshifts. This includes the Milky Way, which was mapped in 
detail by IRAS, and the FIRAS and DIRBE instruments aboard the CORE 
satellite [219]. The importance of dust emission as a contributor to the lu- 
minosity of typical low-redshift galaxies was first established by IRAS at 12, 
25, 60 and 100 ^m, yet the most luminous galaxies detected by IRAS lie 
at redshifts of only about 0.2 or 0.3. The SEDs of galaxies like the Milky 
Way typically peak at around 200 /xm, while hotter galaxies more typical of 
those found in the IRAS survey peak at wavelengths between 60 and 100 /xm 
(Fig. 1). Hence, the temperature of the dust in IRAS galaxies can be deter- 
mined only from the 60— and 100 — ^m channels. In fact, because these bands 
both lie either close to the peak of the SED or on its short wavelength side, it 
is difficult to extract detailed information about dust temperature, especially 
for cooler galaxies (Fig. 2). The IRAS data imposes a maximum permitted 
temperature, to avoid generating excess flux in the 25 — /xm band. The tem- 
perature of cooler galaxies with SEDs that peak at wavelengths longer than 
the 100 — /xm IRAS band are difficult to constrain without data at longer 
wavelengths. 

A subset of galaxies detected by IRAS were subsequently selected for ob- 
servation in pointed observations using the ISO satellite in the mid 1990’s. 
These observations were made using a variety of imaging and spectroscopic 
modes, at wavelengths over a wider range than IRAS, covering from 5 to 
200 ^m. Deeper imaging surveys using ISO [164,218,7,114] found new popu- 
lations of dusty galaxies at greater distances than IRAS, providing significant 
information about the evolution of relatively low-redshift dusty galaxies. 

At longer submillimetre wavelengths, information is still relatively scarce. 
Using the first generation of single-element submillimetre-wave bolometer 
photometers, IRAS galaxies were detected in the 1990’s [10], providing infor- 
mation about their SEDs at longer wavelengths. The commissioning of the 
SCUBA 450/850 — /xm camera with a 5-arcmin^ field of view at the JCMT 




18 



Andrew W. Blain 



provided opportunities for studies of the long-wavelength emission from a 
larger sample of IRAS galaxies [107,179,108]. The typical temperature of 
this IRAS-selected sample [107] is T = 38 ± 3K, with an emissivity index 
(3 ~ 1.3 ±0.2. Observations of more luminous low-redshift galaxies [179] tend 
not to indicate a dramatic luminosity dependence of the SED, other than to 
note that spiral galaxies with infrared luminosities of order several 10^*^ Lq 
have SEDs with T ~ 20K, whereas more luminous 10^^ — Lq galaxies have 
T ~ 40 K. 40 K also seems to be the typical temperature of many high-redshift 
submillimetre-detected AGN [32,201,153]. 

However, there are exceptions. The nearby irregular galaxy M82 has a 
well-determined dust temperature T ~ 40 K, despite a relatively low lumi- 
nosity of about 3 X 10^° Lq [53]. It is clearly involved in a short-lived burst of 
star-formation activity, however, and so conditions within are perhaps likely 
to be more similar to those in more luminous galaxies. Also, some moderate 
and high-redshift AGN have much higher dust temperatures [93,175]. This 
is likely to represent a significant fraction of the luminosity of these galaxies 
being generated either close to the accretion disk of an AGN, or from a re- 
gion of a galaxy in which the ISRF is at least an order of magnitude more 
intense than in cooler 40-K objects. It is not yet possible to be sure of the 
true distribution of dust SEDs in high-redshift galaxies, and some 175 — /rm 
hSO-selected galaxies have been found with very modest dust temperatures 
close to 30 K [76]. Hopefully, the large-format, very sensitive images expected 
from the MIPS instrument on the forthcoming SIRTF satellite will provide 
the necessary information at wavelengths of 24, 70 and 160 /im, spanning the 
peak wavelength of any reasonable galaxy SED. The SINGS SIRTF Legacy 
Project‘d will generate detailed information for about low-redshift galaxies, 
while a variety of deeper surveys will investigate the evolution of galaxies 
and enable cross-correlation studies with a large sample of optically-selected 
galaxies. 

2.7 Absorption Studies 

Optical: The first technique to study ordinary galaxies in the high-redshift 
Universe was observations of absorption lines against the intense optical ra- 
diation from QSOs. While the emission from these galaxies, which can be op- 
tically thick and very faint, is difficult to detect, they can be clearly detected 
against bright background sources by way of saturated absorption features, 
in some cases with exquisite signal-to-noise ratios [117]. These studies began 
as soon as the first high-redshift QSOs were identified. Gunn & Peterson 
[144] noted in 1965 that the existence of detectable emission bluewards of 
redshifted Lyman-a in the spectra of QSOs indicated that the intergalactic 
medium (IGM) along the line of sight was almost entirely ionized, presenting 

Information about SIRTF Legacy Projects can be found at 

http://www.sirtf.caltech.edu . 
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no resonant scattering cross section. Only this year has the first evidence for 
such absorption been presented, at z > 6 [30,99]. The IGM along the line 
of sight to distant galaxies is ionized; while discrete dense, neutral systems 
provide Lyman-a absorption lines in QSO spectra, with the most spectacular 
results coming from the brightest illuminating sources [117]. In principle, the 
abundance and density of the absorbers can be used to trace the evolution of 
galactic systems that contain cool neutral gas, subject to possible selection 
effects, such as bias against including QSOs with significant foreground ex- 
tinction in surveys [118], which now seems relatively unimportant based on 
samples selected with no foreground extinction in the radio [116]. 

The evolution of the cosmic density of neutral gas that results from inte- 
grating the whole population of Lyman-a absorbers is consistent with most 
estimates of the rate of gas consumption. However, it is difficult to disen- 
tangle the role of time-dependent effects: the consumption of gas in stars, 
its dispersion in stellar winds, the formation of new galaxies and the assem- 
bly of reservoirs of cold absorbing gas in pre-existing dark-matter haloes. At 
all observed redshifts, the abundance of the most massive, gas-rich systems, 
the damped Lyman-a systems, with hydrogen column densities greater than 
10^°cm“^ seems to be rather similar [213]. The details of these optically- 
based studies are far beyond the scope of this paper, but it is important to 
remember that optical studies of cold gas in the high-redshift Universe can 
be made. 



Gamma- Ray Bursts: One of the most exciting prospects is extending stud- 
ies of absorption line systems to include objects silhouetted against the in- 
tense short-lived optical flash/afterglow emission of high-redshift 7-ray bursts 
(GRBs). GRBs are thought to trace the formation rate of high-mass stars, 
on the grounds of their apparent association with iron X-ray absorption fea- 
tures, typical of the debris from supernovae, supernova-like features in some 
of their light curves [220], and their common association with irregular blue 
host galaxies [98]. As a result of their exceptional brightness, GRBs should 
be detectable out to the highest redshifts, and as 7-rays are not subject to 
absorption by gas or dust, GRBs even a,t z ^ 6 should be readily detectable 
at MeV-GeV energies, regardless of foreground structures, including galaxies 
with a dense neutral ISM. The associated optical/UV afterglow emission from 
such a GRB will then illuminate the full redshift range over which galaxy for- 
mation has taken place. Particularly promising is the prospect of using the 
emission from the extremely bright, very-short-lived, prompt GRB optical 
flash that sometimes occurs (U ~ 9 for 15s from z = 1.2 GRB 990123 [227]), 
to probe absorbers with column densities too low for detection otherwise. 

Unlike QSOs, which ionize the surrounding IGM and modify the ioniza- 
tion state of the ISM in galaxies up to about 1 Mpc away, causing a deficit 
of absorption lines at redshifts close to the QSO, GRBs would not suffer a 
‘proximity effect’ [231]. The proximity effect is useful for estimating the inten- 
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sity of the extragalactic radiation field at the redshift of the target QSO, but 
prevents detailed studies of nearby galaxies, that could perhaps be members 
of a group or cluster surrounding the QSO. Only the physical conditions in 
the very local pc-scale environment within the host galaxy of the GRB would 
be affected by a stellar progenitor. 

The drawback of GRB absorption studies is the single opportunity to 
observe them, with no prospects of a later confirmation. Instrumental setup 
and acquisition must be rapid and reliable in order to avoid missing the ac- 
tion, an important consideration when minutes are required to read out data 
from the largest GGD cameras. A few GRBs per thousand are expected to 
be gravitationally lensed into multiple images by foreground galaxies, groups 
and clusters. Each image would be magnified by several times. With time 
delays from minutes to months, these lensed GRBs could provide a great op- 
portunity for observers to be waiting for the arrival of each subsequent image, 
so that the light curve of the same GRB can be studied again, including its 
earliest rise. Tensing deflections of photons are sufficiently small that even 
the most tightly beamed GRB should still be visible along the different lines 
of sight. 

Studies of the submillimetre and far-infrared emission from the host galax- 
ies of GRBs also offer interesting prospects for investigating the evolution of 
galaxies: see later. 



Millimetre/ Submillimetre Absorption Studies: A comparable absorp- 
tion probe has been used at millimetre wavelengths to detect absorption by 
sometimes exotic molecules against the strong radio continuum emission of 
background radio-loud AGN. Only a handful of candidates have been de- 
tected, along the line of sight to gravitationally lensed AGN where the tens- 
ing galaxy is within about 1 arcsec of the background radio source [268] . The 
small number of detections is due to a combination of the need for deep in- 
tegrations and bright background sources. Because the molecular gas in a 
typical galaxy is much more centrally concentrated than neutral hydrogen 
- the scale lengths differ by about an order of magnitude - the area on the 
sky intercepted by molecular absorption systems is smaller than for Lyman-a 
systems. Additionally, only a single line in an absorbing object can be ob- 
served at once, as the spectral bandwidth of existing instruments covers only 
a narrow spectral region < 1 GHz wide, corresponding to a a very narrow 
redshift interval at observing frequencies about 100 GHz. This is in contrast 
to the ability to resolve lines from about 40 to llOnm, Avjv ~ 1, in an 
optical QSO Lyman-a absorption spectrum. 

When a wider band is available using ALMA, and wide-band spectro- 
graphs designed for detecting molecular-line redshifts are in service on single- 
antenna millimetre-wave telescopes, it may be possible to observe a ‘forest’ 
of intervening GO-line absorbers against the radio continuum emission of 
more distant radio-loud AGN, while searching for GO emission/absorption 
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from the AGNs themselves. The limited spatial extent of the absorber, and 
the relatively narrow spectral range is still likely to render millimetre-wave 
CO absorption line studies less significant than optical absorption studies for 
understanding galaxy formation; however, they are sure to be valuable for 
studying low-excitation, representative, high-redshift galaxies, and ordinary 
regions of the ISM in fortuitously aligned galaxies. 

3 Evolution of Low-Redshift Galaxies 

3.1 Description of Galaxy Surface Densities 

At this point it is necessary to discuss some of the descriptive terms required 
to analyse the evolution of galaxies. First, a statistical luminosity function 
(p^{L,z) at an observing frequency v, which describes the space density of 
galaxies with luminosities between L and L + dL in the redshift interval 
between z and z -I- dz. It is possible to define L as the total luminosity of a 
galaxy, in W or Lq, or as the luminosity in a frequency range centred on v, 

in WHz“^. As IRAS 60 — /rm observations were made at a frequency close 
to the peak of the SED of a typical dusty galaxy, the luminosity of this class 
of galaxy is often described by vL^, a quantity with dimensions of power, 
that differs from the true luminosity L = f diy by a factor of order unity. 

Luminosity functions can typically be described by a Schechter function 
form [229], 

<P{L) = <Po exp(^-^^. (1) 

Whether the luminosity function is defined per unit linear or logarithmic in- 
terval is largely a matter for personal taste. As luminosity functions span 
many orders of magnitude it is usually sensible to use a logarithmic form in 
numerical integrations to yield observational quantities. If logarithmic defini- 
tions are used, then it is conventional to divide by a factor of L in the defini- 
tion above. The density of galaxies as a whole is such that ~ 0.1 Mpc“^, 
L* ~ IQiiLo and a ~ -1.1. 

The form of the Schechter function is intrinsically interesting. There is 
little reason to expect a power-law tail at low luminosities - galaxies could 
have a preferred luminosity. The low- luminosity power-law behaviour, with 
a > —2 at small L to ensure that f LI>{L) dL converges, could be indicative 
of strong interactions and feedback in the star-formation process. Power- 
law distributions describe self-similar, self-regulating behaviour observed in 
complex interacting natural systems, such as the distributions of wave heights 
or the dimensions of forest clearings. The exponential cutoff could then clearly 
indicate a natural scale that the process has yet to reach, and on which 
luminous/massive objects have yet to form. The luminosity function of dusty 
60 — /rm IRAS galaxies [247,228] (Fig. 4) is unusual, having a quadratic not 
exponential high-luminosity cutoff. This is unlike any other population of 
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Fig. 4. The luminosity function of low-redshift galaxies detected by IRAS at 60 /rm. 
The data points and thin solid line correspond to different determinations [247,228]. 
The difference is only significant at the faint end. The observed $ oc L~^ high- 
luminosity tail of this luminosity function is unusual, and almost certainly reflects 
the appearance of a population of strongly evolving luminous galaxies that are 
the first sign of the abundance of such objects in the high-redshift Universe. This 
luminosity function, consisting of both very luminous merging galaxies at the high- 
luminosity end, and ordinary spiral galaxies at the low-luminosity end is an im- 
portant benchmark that successful models of the properties of high-redshift dusty 
galaxies must reproduce. The overplotted curves show Press-Schechter luminos- 
ity functions, assuming a constant mass-to-light ratio and different values of the 
fluctuation spectral index n or 7 (Sect. 4.7). 



galaxies, and seems to indicate that the most luminous of all local galaxies 
are relatively brighter using IRAS than at other wavelengths, presumably 
due to their greater tendency to be obscured by dust at optical wavelengths. 

The luminosity function can be the sum of a variety of different galaxy 
types, or a grand multivariate function describing the whole population; how- 
ever, the subpopulations also seem to be described individually by Schechter 
functions. To determine a luminosity function requires knowledge of the red- 
shifts of the galaxies in a survey, so that their detected flux densities S,y, in 
units Wm“^Hz“^ can be matched to their luminosities,® and the volume in 
which they could be detected can be calculated to construct the function. 

The observational quantities that follow from the luminosity function are 
defined in terms of distances. We will return to define cosmological distances 
in Sect. 4.4. 

® lWm“^Hz“^ is equivalent to 10^® Jy (Jansky) in radio-based units. 
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Counts of Galaxies: The cumulative counts of galaxies on the sky brighter 
than a flux density S^, in units of sr“^, are given by 



N{> S,) 




<PD{r')^ dL dr' 



dr') 



(2) 



dr' is the radial distance element in the Universe in comoving coordinates, and 
D is the comoving distance parameter, with a functional form that depends 
on the geometry of the Universe, rmax is the distance to the most distant 
galaxy. The equation can easily be transformed into an expression for redshift 
0 , as r{z) is easy to determine (see Sect. 4.4). Strong observational evidence 
now constrains the geometry of the Universe [94], removing much of the 
capacity for different authors to use different cosmological parameters when 
interpreting observed results. The integral includes all the galaxies that are 
brighter than a flux density 



S. 



L fu{i+z) 

+ z) j dv' ’ 



( 3 ) 



where L is the bolometric luminosity and represents the SED. The factor of 
1 + z in the denominator takes account of the redshifted observing bandwidth, 
and the reductions in both photon energy and arrival rate due to cosmology. 
The sampled restframe frequency of emission i>{l + z) is blueshifted from the 
observed frequency. Tmin in (2) can be found by evaluating L in (3), and 
may differ between sub-populations, depending on the SED. It is possible to 
impose multiple constraints on the properties of galaxies that contribute to a 
count. For example, if the count of galaxies that exceed chosen flux densities 
at two different wavelengths are required, then the values of Emin for each can 
be calculated, and the most restrictive then applied to calculate the integral. 

At low redshifts and for bright sources, the integrals can be evaluated 
immediately: N(> S^) oc Si, for any luminosity function. As the counts 
are pushed to greater depths, the change in their power-law slope indicates 
the presence of evolution in the galaxy population. If they rise above the 
—3/2 slope, then evolution is positive, if they fall below, then it is nega- 
tive (see Fig. 5, in which the derivative of a count is plotted with this slope 
subtracted). When large numbers of sources are available, this differential 
count dfV(> S,^)/dSi, can be determined (Fig. 5), avoiding the cumulative, 
non-independent nature of the integral count N(> Si,). 



Background Radiation Intensity: Another important observable quan- 
tity is the total amount of background light Ii, contributed by a population 
of galaxies. This can be obtained at any wavelength by integrating over the 
count distribution, or directly from a luminosity function. 



I. 



Js„dN{> Si,) 




L<P fi,{i+z) 

47t(1 + z) J fi,' diz' 



dLdr' , 



( 4 ) 
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Fig. 5. Counts of galaxies determined at 60 fim from various IRAS surveys, summa- 
rized most recently by Bertin et al. [40]. Deep surveys at the different wavelengths 
of 15, 90 and 175 /rm using ISO have extended the evolution baseline in redshift 
from 2 = 0.1 to 2 ; ~ 1, but prior to SIRTF at 60 fim, the IRAS results remain 
the deepest survey of the Universe. The plotted form of the counts ensures that 
Euclidean counts N{> Sv) oc 5“ with a = —3/2, expected in cases where there is 
no evolution of galaxies, no cosmological factors in distances and a geometrically 
flat Universe, lie horizontally in the figure. That the faint counts rise significantly is 
clear evidence for evolution in the population. The two curves for evolving models 
(solid and dashed lines) correspond to the best-fitting values shown in Fig. 6. Cos- 
mological factors and the effect of redshifting the SED naturally leads to a downturn 
in non-evolving counts in this plotting format, as shown by the dotted line. 

in units of Wm“^ Hz“^ sr“^. A lower limit to the background radiation in- 
tensity, which may or may not be useful, or comprise a significant fraction of 
the total, can always be obtained by summing over a measured count. For the 
deepest optical, near-IR and submillimetre-wave surveys, these limits are in- 
teresting, and comparable to absolute background measurements determined 
from the brightness of the sky after subtracting the emission from known Zo- 
diacal and Galactic components [259,38,55,88]. If the observations are made 
from within the atmosphere, then the atmospheric emission must also be 
subtracted. 

From IRAS the limit to the background density that can be obtained by 
summing over the 60 — /im counts brighter than 30mJy (Fig. 6) is about 6.3 x 
Wm“^ sr“^, only 5% of the total emission from all external galaxies 
at this wavelength. Hence, it provides a very weak limit on evolution, other 
than confirming that most of the background radiation at this wavelength 
is generated by galaxies too distant for IRAS to have detected. Since IRAS 
flew, constraints that are tighter by a factor of about 3—10 have been imposed 
by ISO surveys at longer and shorter wavelengths (see Fig. 16). 
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Fig. 6. Constraints on the form of the evolution of the luminosity function of galax- 
ies at 2 ; < 1 from IRAS [40] and ISO 175 — /rm data [164,165]. The left-hand panel 
shows the results of fits to IRAS data, represented by solid contours, showing fit 
qualities of = 1,2, 3, 4 and 5, and ISO data, represented by dashed lines cor- 
responding to reproducing the data [164], and for missing by factors of 0.5 and 
2. This is carried out in an ‘anvil model’ ]56] in which luminosity evolution has a 
(1 -I- zY form out to a redshift greater than that sampled by these populations of 
galaxies. The right hand figure shows the joint constraints imposed, and compares 
the effect of a ‘peak model’, an alternative parametrization in which the (1 -I- zY 
luminosity evolution flattens off smoothly at a ~ 1.5. The effect of choosing these 
slightly different forms of the evolution functions is not significant. The evolution 
of low-redshift dusty galaxies out to 2 ~ 1 is strong, and matches the form derived 
from optical data ]177]. 



Redshift Distribution: The redshift distribution of galaxies brighter than 
a specified flux density or subject to some other chosen condition can be 
determined by evaluating (2) without integrating over z. The redshift distri- 
bution of a carefully selected sample of galaxies is the most useful quantity 
for determining the form of evolution. In general, a redshift distribution pro- 
vides more information than the count from which it is drawn, while a deep 
count in turn provides more information than a determination of the back- 
ground intensity, which is undifferentiated by galaxy flux density. In terms 
of observational effort, finding a redshift distribution is demanding: accurate 
positions must be determined and follow-up spectroscopy carried out. How- 
ever, the need for the careful analysis of systematic effects and the accurate 
determination of the contribution from unrelated sources of emission can also 
make background observations challenging [217,38]. 



Fossil Records: Alongside the radiation produced by luminous galaxies, 
remnants of the activity are also generated. For example, metals are generated 
by high-mass star formation activity, and distributed into the ISM of the 
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host galaxy with a ~ 40% efficiency at the end of stellar evolution. The 
number of low-mass stars still burning that formed at high redshifts can be 
calculated, assuming a stellar initial mass function (IMF) [171], and models 
of stellar evolution. When compared with the observed abundance of stars 
in the Milky Way, or the integrated colours and densities of local galaxies, 
then this provides a useful but weak constraint on the high-redshift IMF 
in advance of huge 100-m-class near-IR/optical telescopes that can resolve 
individual stars directly in a representative sample of galaxies extending as 
far as the Virgo cluster. Accretion power onto AGN instead generates massive 
remnant blackholes in galactic centres. 

The density p of any remnant from the galaxy formation process, either 
long-lived low-mass stars, heavy elements or stellar remnants - white dwarves, 
neutron stars and low-mass blackholes - can be found by integrating the 
energy production rate or ‘luminosity density’ pn = jL^PdL within a fixed 
comoving volume over cosmic time, 

1 1 dr 

p = K p^{t)dt=- T{z)-—-dz, (5) 

Jo c Jo 1 + zdz 

where the value of K determines the link between luminosity of the sources 
Pl and the generated material. 

For low-mass stars, K depends on the stellar IMF, with 

/*A^ex / /*-^max 

K (X [ IMdM / [ IMdM , (6) 

where Mmin and M^ax are the lower and upper mass limits of the whole 
IMF, and Mex is the mass of the most massive star still burning at the 
present epoch, which depends on its formation epoch and theories of stellar 
evolution. For nucleosynthesis, K ~ (0.007c^)“^, reflecting the efficiency of 
fusion energy conversion, which is about 0.5-0. 7% of the rest-mass energy of 
hydrogen when converted into helium, or iron respectively. For AGN, K ~ 
e“^, where e is the potentially much greater accretion efficiency (~ 30%). 
The typical IMF and the accretion efficiency could be quite different at high 
and low redshifts, and so the results might be difficult to interpret. 

This type of integral constraint can, however, impose useful limits on the 
amount of activity at high redshifts. For example, the ratio between black- 
hole and stellar bulge mass in elliptical galaxies and spiral bulges implies 
that the relative amount of energy emitted by accretion as compared with 
star-formation activity was of order 1:5 [51], regardless of cosmology, if the 
processes are assumed to be coincident in time, and so pl(^) has the same 
form in both cases. Of course, this need not be true. The link between black- 
hole and star formation is very important. If this Magorrian relation is the 
result of a series of coincidences, then it is an important goal to list them all 
and to determine their respective roles in the ‘conspiracy’ to fix this ratio. 
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It seems unlikely that no deeper link exists, reflecting an important part of 
the galaxy formation process [239]. However, another remarkable result, the 
existence of the fundamental plane of elliptical galaxy properties [97], has 
remained an interesting but rather ill-explained feature of our understanding 
of the formation of elliptical galaxies for many years. The realization that 
elliptical galaxies have an unexpectedly narrow range of properties has not 
allowed the process of their formation to be determined absolutely. 

3.2 Detecting Evolution in the IRAS Survey 

From the IRAS survey it was clear that comparable amounts of energy were 
being released by both nearby galaxies and the Milky Way, at optical and 
far-IR wavelengths. Hence, it was clear that far-IR luminous galaxies could 
make a significant contribution to the energy budget of the Universe, if this 
continued out to high redshifts. However, IRAS did not reach out to cosmo- 
logical redshifts®. It is interesting to review the evidence for strong galaxy 
evolution available from IRAS results in the late 1980’s, but we must first 
put the studies into context. 

Although deep galaxy surveys had been made at optical wavelengths since 
the advent of CCD detectors around 1980, the type of evolution of the lu- 
minosity function was not evaluated effectively until large galaxy redshift 
surveys were carried out in the early/mid 1990’s [177]. The results from op- 
tical redshift surveys at z < 1 were obtained at almost the same time as the 
deepest optical HDF-N image of the distant Universe was taken. Analysis of 
the colours of the faint galaxies provided approximate ‘photometric’ redshift 
information, backed up by campaigns of spectroscopy at the 10-m Keck tele- 
scope, confirmed that the luminosity released by galaxies in the high-redshift 
Universe is greater by an order of magnitude than at low redshift [187,86,273]. 

These results tend to be supported by subsequent observations, but a 
less dramatic form of evolution has been claimed from redshift surveys that 
select galaxies at several optical wavelengths, not just the /-band [87]. Ob- 
servations of AGN had already demonstrated strong evolution of the number 
of double-lobed radio galaxies [104], and optically-selected QSOs [119], con- 
clusions based on count data and backed up by reasonably complete redshift 
distributions for the sources. At the time of their publication, details of the 
evolution of /RAb'-selected galaxies, were consistent with the evolution in- 
ferred from AGNs to much greater redshifts [200]. 

How can the observed counts and backgrounds be linked to the form of 
evolution of galaxies? There are two conventional forms of evolution, which 
empirically describe the evolution of luminosity functions ((1)). One is ‘pure 
density evolution’, in which <Pq is a function of redshift but L* is constant. 
This ensures that the evolution of both the space and luminosity density of 



Even in 1985, when only z = 0.5 being a high redshift, and the only more distant 
objects with redshifts were radio- and optically-selected AGN. 
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galaxies, j4>dL and f L<l>dL respectively, is described by n{z) = <Po{z)/^o{^)- 
This simple scaling feeds through to boost both counts and backgrounds. An 
alternative is ‘pure luminosity evolution’, in which is fixed and L* is a 
function of redshift, with the evolution function g = L*(z)/L*(0). This in- 
troduces the same dependence of the luminosity density on redshift, when 
the entire luminosity function is integrated, but leads to an enhanced lumi- 
nosity density of galaxies brighter than a certain detection threshold. As a 
result, the two forms of evolution affect the counts in different ways. The ad- 
ditional factor that shifts the detection threshold means that a certain factor 
of luminosity evolution has a more dramatic effect on the counts than the 
corresponding factor of density evolution. If the evolution functions are rep- 
resented by (1 -I- zY, then the value of p required to fit count data assuming 
density evolution is approximately twice that for luminosity evolution. The 
effects of including luminosity evolution of the IRAS luminosity function for 
explaining the counts of galaxies detected using IRAS and ISO are compared 
in Fig. 6. p~ 3.9 is needed to explain the data. Note that the assumed dust 
temperature affects the results (Fig. 5). A value of p ~ 3 was often quoted for 
the form of evolution of dusty galaxies at low redshift, a value which made 
sense because it matches the rate of evolution of AGN. This value provides 
a good fit if T ~ 50K is assumed. The value p ~ 4 provides a better descrip- 
tion in connection with a temperature T ~ 40K. Both p and T values are 
determined by the joint requirement to explain the observed counts of the 
more distant 175 — pm ISO galaxies (Fig. 14)’^. 

Even a simple Schechter luminosity function representing the real Uni- 
verse is sure to involve variations in both and L* with redshift as the 
Universe evolves. However, the observational consequences are such that lu- 
minosity evolution must dominate, if we are to account for the observed 
counts without either generating too much background light, consuming all 
the baryons into stars or filling the Universe with relic blackholes [56] . 

The count determined for IRAS galaxies at 60 — pm, in a conventional, 
but complex, notation, is shown in Fig. 5. Although the results vary between 
the different deep samples in different areas of sky, analysed using different 
techniques, there is clear evidence for an excess of sources at flux densities 
around 1 Jy, indicating positive evolution. By linking the results at 175 pm 
from ISO and at 60 pm from IRAS assuming a form of luminosity evolution, 
the fitted model curves shown in Fig. 5 are obtained. 

3.3 Galaxy Formation Scenarios 

IRAS observations by 1990 hinted that substantial evolution of the popu- 
lation of reasonably ordinary, representative dusty galaxies was taken place. 
However, most of the galaxies detected by IRAS were already detected in op- 
tical Schmidt camera surveys. They map the current end points of the ongoing 

^ Of course, there may be some favoritism towards p = 4 today to match the 
inferred rate of low-redshift evolution in optically-selected galaxies [177]. 
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process of galaxy formation, unable to probe the evolution over an extended 
timeline. The only galaxies then detected in representative numbers at high 
redshifts were AGN. Not until 1995 did the first spectroscopically confirmed 
samples of Lyman-break galaxies (LBGs) at z ~ 3 appear [253] . 

In 2002, the details of the process by which galaxies formed is still uncer- 
tain. There is a standard model, the hierarchical growth of structure domi- 
nated by a cold (non-relativistic) dark (non electromagnetically interacting) 
matter (GDM). A single, but revolutionary observation could change our un- 
derstanding; however, the GDM model has proved to be robust, and it has 
been modified to account for almost a decade of increasingly rapid discovery. 
Glear alternative paradigms, such as the generation of large-scale structure 
in explosions at z > 10, by sweeping up shells of originally uniform IGM ma- 
terial into a honeycomb network, or the early formation of structures on the 
largest scales from the free streaming of hot (relativistic) dark matter into 
caustic structures - ‘Zeldovich pancakes’ - on mass scales of order 10^® Mq, 
followed by their fragmentation to form galaxies, are all at odds with current 
data. In most cases, abundant high-redshift activity would heat the IGM to 
a temperature sufficient to wash out the observed GMB anisotropies [94] . An 
excellent review of these galaxy formation processes, dating back to when 
they were largely unchallenged by observations was provided by Efstathiou 
[112]. The detailed large-scale structure mapped by galaxy redshift surveys 
[83,58], which now reach out to redshifts where structure has evolved signifi- 
cantly, and certainly include representative cosmic volumes and the measured 
scales of the most significant structure in the GMB, all point to a Universe 
dominated by GDM. There has been discussion about the expected num- 
ber of small dark matter halos, predicted by GDM simulations, that are not 
apparently associated with luminous galaxies [138]; however, this concerns 
the exact nature of small-scale substructure, and not its presence or absence. 
Without a significant density of GDM, then the bound structures observed 
at z ~ 6 would have difficulty forming from the density contrast in baryonic 
matter that is measured via the GMB to be only about 10“® at z ~ 1100. 
Unless we are missing something significant, then the standard hierarchical 
GDM model seems likely to remain as the underlying description of the pro- 
cess of galaxy formation. 

The standard GDM model involves the gravitational growth of bound 
structures under the influence of an initially almost smooth density field in 
an expanding Universe. Inhomogeneities were generated by quantum fluctu- 
ations during inflation - a period of very rapid early expansion. Inflations 
makes these inhomogeneities amenable to producing a Universe like our own. 
The most natural form of inflation renders the Universe spatially flat, and 
dramatically reduces the density of exotic particles that could have been cre- 
ated at even earlier epochs, such as magnetic monopoles, consistent with their 
non-detection. 
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This picture of the early Universe is consistent with both the observed 
properties of the CMB, which maps the density of baryonic matter at ‘re- 
combination’, when the Universe cooled below the ionization temperature of 
hydrogen, and became transparent to radiation for the first time, and with the 
abundances of light elements formed by thermal nucleosynthesis about one 
minute after the beginning of the Universe and inflation: primordial big-bang 
nucleosynthesis . 

In studies of galaxy formation, these details can be taken as initial con- 
ditions. This is no longer just an assumption. Concrete images of the CMB 
from the BOOMERanC balloon-borne experiment [94,89] and the ground- 
based DASI [69], CBI [202,238] and VSA [257] interferometers, at the South 
Pole, Atacama and Tenerife respectively, provide solid predictions. The re- 
sults of the all-sky 10-arcmin resolution CMB image from the NASA Wilkin- 
son Microwave Anisotropy Probe (WMAP) satellite (launched in June 2001) 
provided an extremely accurate image of the CMB in February 2003. 

The distribution of dark matter that is consistent with CMB observa- 
tions ensures that gravitational instabilities cause the first collapse of struc- 
tures on small rather than large scales. The subsequent merging of collapsed 
structures then builds up increasingly massive objects. This process has been 
modeled in great detail using N-body gravitational simulations [207], and can 
be represented reasonably accurately by an analytic function due to Press & 
Schechter [216], extended [20], updated [237] and made more widely appli- 
cable [61] by others. The density contrast of linear overdense regions grows 
as (1 -I- z)~^ at high redshifts (Sect. 8.2). At lower redshifts, where a cosmo- 
logical constant dominates the dynamics, the growth of structure is slowed. 
Hence, as recombination took place at z ~ 1100, and the density contrast in 
baryonic matter revealed by CMB observations is only about 10“®, at first 
sight this indicates that galaxies with non-linear overdensities 3> 1 required 
for the formation of galaxies could not have formed by the present epoch. 
Note, however, that the baryonic matter is prevented from collapsing prior 
to recombination by radiation pressure on the electrons, and the electrostatic 
transfer of this pressure to the baryons. CDM does not interact electromag- 
netically, and so couples to the baryons only through gravity. This means that 
overdensities in the dark matter field are unaffected by radiation pressure, 
and can start to grow as soon as the Universe becomes matter dominated 
[193], significantly before re-ionization, and provides an important head start 
for galaxy formation, leading to deeper potential wells for the newly neutral 
baryonic matter to fall into after recombination. In light of this argument it 
seems difficult to doubt the existence of non-baryonic dark matter. 

The pre-recombination Universe is smooth and simple. The CMB provides 
a backdrop to the stage of galaxy formation; however, after recombination 
and the formation of the first objects a rich range of physical processes start 
to contribute and interact. The mix of relatively simple processes, such as 
the growth of gravitational instabilities in a matter field dominated by an 
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unknown type of non-interacting gravitating matter, the hydrodynamics of 
collapsing gas and the radiation processes that cool the gas conspire to gen- 
erate a wide range of possible outcomes. That is even before the complex 
details of the geometry of the cores of the dense gas clouds in which the 
first metal-free stars form, their stellar astrophysics, and the feedback effects 
from their perhaps exotic supernovae combine to make galaxy formation a 
complex process. 

At present it is possible to model the process of structure formation in 
CDM through the history of the Universe on scales that include both the 
sub-structure in the densest clusters of galaxies [250] and the largest scale 
structures that can be observed at the present epoch [207]. If the processes of 
gas dynamics and cooling are included, then the resolution of the best current 
N-body simulations is close to matching the scales of the dense clumps of gas 
in which the first stars formed [1,65]. The feedback effects from supernova 
from these stars is, however, a quite different story. The messy astrophysics 
after the first stars form® may never be simulated in sufficient detail to cap- 
ture the richness of the observed Universe on sub-Galactic scales [266]. This 
view may be unduly pessimistic, but it is hope that cosmic N-body simula- 
tions will be able to trace to the scales of planet formation and the origins of 
life. 



3.4 Gas Cooling and Star Formation 

At the present epoch, the regions in galaxies where stars form contain richly 
structured distributions of atomic and molecular gas and dust grains. These 
regions are dense (up to > 10®cm“®) and cool (10-100 K), and there are a 
wide variety of ways for gas, dust and radiation to interact. They contain 
significant magnetic fields, which can have large effects on the dynamical 
processes. 

These conditions are likely to be similar in recognizable high-redshift 
galaxies. While we do not understand all the details of star formation even in 
the Milky Way, it remains vital to study them, and the allied events taking 
place in nearby and distant galaxies. It is also important to bear in mind that 
the first stars generated the heavy elements whose properties now strongly 
influence the astrophysics of star formation in giant molecular clouds. The 
formation of this first generation of stars may have been very different. The 
regions where stars form in the Milky Way that can be studied relatively eas- 
ily are not the same as the high-redshift clouds in which the first generations 
of stars form. The metallicities in the Milky Way should all be significantly 
greater, while there should be more dust, stronger magnetic fields and an 
ISRF [182,1]. 

The properties of the first generation of stars are critically important for 
determining the nature of subsequent galaxy evolution, and could synthesize 



Described by Dick Bond with the double entendre ‘gastrophysics’. 
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the elements required for life, and for the study of the cold Universe. They 
are the targets for the next generation of large astronomical facilities: 30-m 
ground-based telescopes such as the California Extremely Large Telescope 
(CELT) concept, JWST, ALMA and others being designed and proposed, 
including a mid-IR 10-m-class single-antenna telescope, developed from the 
SAFIR ‘sapphire’ concept and a 1-km baseline interferometer operating at 
the same wavelength like the SPECS concept [191]. Ultimate understanding 
of the galaxy formation process requires that we understand the process of 
high-redshift star formation in detail. 

In primordial gas, magnetic fields are expected to be small, prior to the 
action of dynamos in stars, rotating partially-ionized galaxy disks, or from 
electrons accelerated in AGN-powered radio galaxies, whose magnetized lobes 
of radio emission could ultimately permeate intergalactic space [131]. As a 
result, magnetic effects should not hold up the gravitational collapse of cool 
gas on stellar scales. The only addition to radiation from hydrogen that could 
be observable is from polar DH and LiH molecules. However, their abundance 
is low, and the only source of energy available to them will be the gravitational 
potential energy of contracting low-mass objects, and so it is unlikely that 
the effects will be dramatic. 

Cooling by atomic hydrogen transitions is only effective if temperatures 
of ~ 10^ K are reached, when collisions can excite atoms into the ionized 
state required for radiative cooling. This is the virial temperature reached by 
collapsing gas clouds with masses ~ 10® Mq at high redshifts. In hierarchical 
models of structure formation, such objects do not form until z ~ 8-10. When 
ALMA and JWST probe this redshift range it may be possible to observe 
the first stars in these proto-galaxies directly. 

Molecular hydrogen vibrational-rotational transitions offer a route to cool- 
ing gas that has been heated by shocks to much cooler temperatures of several 
hundred degrees. Clouds of dark matter and baryonic gas in virial equilib- 
rium with this temperature have masses ~ 10® Mq at high redshifts, which, 
being small, could form in reasonable numbers at redshifts as high as 20. The 
background abundance of molecular hydrogen H 2 at these redshifts compared 
to atomic hydrogen Hi is expected to be of order 10“® by number [84], and 
to form by the rare interaction of free electrons and Hi to form H“ , and then 
H 2 in collisions with H. Direct formation of H 2 in H-H collisions is unlikely, 
owing to the very low ratio of binding energy to kinetic energy. The standard 
low-redshift mode of H 2 formation by surface chemistry on dust grains is im- 
possible prior to metal enrichment. Once high enough densities are reached 
in cooling primordial clouds, 3-body collisions allow H 2 to form much more 
easily. However, in the absence of dust, the H 2 abundance is very fragile. As 
soon as the first stars form and UV light bathes the surroundings, molecular 
hydrogen will be destroyed throughout the birth halo of the star, prevent- 
ing further cooling and star formation, even in the absence of supernovae to 
disrupt the cloud and disperse metals in stellar winds to modify the IGM 
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chemistry. The UV light is also likely to inhibit cooling in neighbouring halos 
by reducing the H 2 abundance. 

Once some metals have been formed, and some UV radiation has been 
released, the special simple conditions that apply to star formation in the 
early Universe are greatly modified. The process of star formation should 
then quickly become more akin to the complex form recognizable locally. 

The outcome of these high-redshift processes are not yet constrained by 
observations: the study of the earliest stars will surely remain a key area of 
interest for both observational and theoretical astrophysics. One of the excit- 
ing recent developments is the association of high-mass stars with enigmatic 
long-duration GRBs [98]. If this association is genuine, then the extreme lu- 
minosity and unobscured 7 -ray luminosity of GRBs could reveal the deaths 
of some of the earliest high-mass stars. These stars could have exotic proper- 
ties, and masses up to ~ 300 Mq [147]. Finding GRBs at very high redshifts 
may be the best way to investigate early star formation, and to constrain 
the consequences for metal enrichment and subsequent generations of cosmic 
structures. 

4 Cosmology 

We have already mentioned several features of the study of galaxy evolution 
that rely on the background cosmology, the properties of the Universe on the 
largest scales. The details are widely available [82,209,205,184]; however, it 
is useful to have them alongside the following discussion of the high-redshift 
Universe. In previous years it was important to discuss the range of possi- 
ble values of cosmological parameters, but it now seems that the measure 
of the Universe has been taken successfully by combining the Hubble dia- 
gram of Type la supernovae [212] and the angular power spectrum of GMB 
anisotropies [94,173,69]. Other measures, more purely probing the density of 
the Universe, like the abundance of high-redshift clusters and the peculiar 
velocities of low-redshift galaxies are in general agreement with the results 
[64]. 

4.1 The Shape and Size of the Universe 

The first descriptions of the Universe under General Relativity (GR) assumed 
homogeneity and isotropy on large scales, notwithstanding that its total ex- 
tent was still sometimes assumed to be little bigger than the very anisotropic 
Milky Way. Ever since, observations have shown that these ‘cosmological 
principle’ assumptions seem to be reasonable. The deepest, largest redshift 
surveys [83,58] show that the Universe is accurately homogeneous on scales 
greater than 100/i“^Mpc®. Observations of the GMB show that it was ho- 
mogeneous and isotropic at very early times [190,94]; both the GORE-DMR 

® Hubble’s constant Hq = 100 h km s“^ Mpc“^ measures the current expansion rate 
of the Universe. 
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and BOOMERanG CMB images include measurements of order 1000 causally 
disconnected regions. For the currently favoured expansion rate and density 
of the Universe, with h = 0.65 and a density parameter = 0.3, this scale 
corresponds to a mass of order 3 x 10^® Mq, which is reassuringly several times 
greater than the mass of the richest cluster of galaxies. Structures may form 
in the future on larger scales. On smaller scales the topology of the large-scale 
structure traced by galaxies is of overlapping sheets intersecting in filaments, 
which in turn radiate from clusters. This structure can be described accu- 
rately by a self-similar power-law power spectrum as a function of scale. This 
is reasonable, probably reflecting the complexity of the process of feedback 
and non-linear clustering that takes place on small scales during the assem- 
bly of galaxies, and does not necessarily reflect anything special about the 
initial conditions for the Universe. The connection between the amplitude of 
structures in the Universe on different scales, as traced from the small scales 
of galaxies up to clusters, filaments and supercluster sheets, up to the largest 
scales traced by observations of the CMB, can now be described and tested. 

Describing and explaining an isotropic, homogeneous Universe is easier 
than any alternative. It was done early in the development of cosmology, 
almost coincident with the discovery of the expansion by all those listed 
in the title of the Friedmann-Robertson- Walker (FRW) metric. A distance 
element ds in this metric, the most general isotropic, homogeneous space that 
can evolve with time, is described by. 



ds^ = df 



R\t) 



dr^ 
1 — 



-I- (d0^ -I- sin^M^^) 



( 7 ) 



The scalefactor R describes the evolution of the size of the Universe as a 
function of cosmic time t. The spatial coordinate r = (r, 0, <j)) is comoving; 
that is particles moving with the Hubble expansion maintain the same values 
of r. The evolution of the scalefactor is determined by the energy densities of 
different components of the Universe, fc = 1, 0 or —1, and determines the ge- 
ometry/curvature of the Universe. If fc = 1, then the Universe is closed, with 
finite volume and a spherical geometry: distant sections at constant radius 
have a decreasing area, and so it can be possible for light rays to take laps 
around the Universe. If A: = 0, as observations of the CMB strongly imply 
[94], then the Universe has a flat /Euclidean geometry, a steadily increasing 
area of spatial sections and an infinite volume. If fc = —I, then the Universe 
is open, with infinite volume, and distant sections at constant distance have 
exponentially increasing areas. There is no reason why the Universe, if in- 
finite, could not have a much more complex geometry on very large scales: 
observations only confirm that over the region we can see, the FRW descrip- 
tion is an excellent one. Of course, the FRW description fails on small scales 
in the Universe, as the spacetime around the Sun and Milky Way is certainly 
not curved uniformly. Amongst the diverse benefits of having structure on 
small scales is the existence of life and the availability of gravitational lensing 
as a cosmological and cosmographic tool. 
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The effects of different curvatures k can be seen by finding the distance 
along the radial direction (d0 = d4> = 0) out to a comoving coordinate 
distance r at constant time. This ‘proper distance’ Jds = dp is given by the 
product of the value of R at the epoch when it is evaluated (so dt = 0) and 
either arcsinr, r or arsinhr, depending on whether fc = 1, 0, or —1. In the 
k = 1 Universe the proper distance reaches a maximum and then returns 
to zero, and it increases ever more rapidly with r if /c = — 1. Note that the 
proper distance is difficult to determine in practice over a large distance; as it 
is defined at a fixed cosmic time, a pre-ordained conspiracy between observers 
is required to evaluate it. 

Without finding the form of R, it is still possible to see how the properties 
of light traveling through the Universe are affected by the geometry. Light 
travels along geodesics with ds = 0. Along a radial geodesic (d0 = d<() = 0) 
from a fixed comoving radial distance at which a light ray was emitted, re, 
at epoch te to the present time to, r = 0, the integral along a geodesic must 
be the same when travelling with adjacent wavecrests, as r is a comoving 
coordinate. As long as R does not evolve rapidly during the time interval 
that is the inverse of the frequency of the signal, then the integrals of R~^ 
over time, with limits that differ by the time separation of these wavecrests, 
from the metric, must also be the same. Hence, {Rv)~^ must be constant. 
The frequency of radiation received vo thus differs from that emitted r'e by 
the ratio of R{to) and R{te), and so the redshift, historically defined as z = 
(Ae — Ao)/Ae in terms of emitted and received wavelengths Ae and Aq is 
given by 1 -I- 2 = R~^, if R is defined to be unity at the present epoch. 
The link between redshift, distance and time depends on the evolution of the 
scalefactor R. 

4.2 Dynamics of the Universe 

To find out how the scalefactor evolves, and thus determine all the time- 
dependent properties of a FRW Universe, it is necessary to solve the Fried- 
mann equations. 



R^ 
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= ^p + A . 



These are the remains of the Einstein GR field equations after imposing the 
symmetry of the FRW metric, p represents the total mass density of the Uni- 
verse, equivalent to the energy density in both matter and radiation, while p 
is the pressure due to both components. A represents the cosmological con- 
stant, a constant energy density that produces an effective negative pressure. 
This term is free to take any value in GR. It is also possible to make it a 
more general time-dependent function, as in ‘quintessence’ models, that ef- 
fectively add a varying fifth force to the Universe on the very largest scales. 
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Quintessence, or A both affect the Universe on the largest scales, and have 
no observable effects on either solar system or Galactic dynamics. The same 
is true for any curvature term - it is only noticeable on very large scales. 
These factors do however affect the way in which overdensities grow in the 
Universe, a key process in galaxy formation (Sect. 8.2). Note that in an ex- 
panding, flat Universe on the largest scales, or at late times, the equations 
are always dominated by A. 

The equations can be expressed in many different forms, but can be con- 
densed into a single substantive equation, plus the conservation condition 
d(pc^R^) = —pd(R^). p and p can be connected by an equation of state 
p = wpc^. Pressure-free matter (often referred to as dust) has w = 0, while 
radiation has w = 1/3. 

Note that with the exception of the cosmological constant, the Friedmann 
equations can be written down from the equations of conservation of energy 
and motion for a uniform spherical region in Newtonian physics. Since a zero- 
A model has no stable static solution, Newton or any successor could have 
predicted an expanding Universe, rather than just Einstein when he included 
A to stop the inevitable, unexpected expansion^°. 

A very useful form in which to determine R(t) and thus z(t) is to combine 
the Friedmann equations so that all four terms can be expressed as equivalent 
densities that sum to unity, expressed as a time-dependent Hubble parameter, 

ff(t) ~ ^ ~ + 12r + 12 a + 12k] ^ ■ (10) 

I7ni = 87rGpni/3i?^ is the density parameter in ordinary matter, with space 
density Pm, 12^ = SirGp^/SH'^ is derived from the equivalent matter density 
in radiation p^, 12 a = A/3JI^ is the density equivalent to the cosmological 
constant and 12k = —kc^/R^H'^ is the density equivalent to the observed 
(small) curvature of the Universe. These different density terms evolve as 
R~^, and R~^ respectively, and so their relative importance changes 

with epoch. 

This expression can be integrated to obtain R(t) in any specified model. 
Popular solutions over the last eighty years have included Einstein’s unstable 
static solution with A = dnGp, k = 1 and rR = 3Gpc, de Sitter’s exponen- 
tially expanding A-dominated model with k = 0 and R oc expt^/A/3, and the 
Einstein-de Sitter matter-dominated pressureless model (the hot theorists’ 
favourite until 1999), with 12,^ = 1, A = 0 and R oc In the radiation- 
dominated version of this model with Gj. = 1, as expected at early times in all 
monotonically expanding Universes that contain radiation, R oc while 
the oddly-popular zero-density Universe with fc = — 1 and = 1 had R oct. 

That no-one did is perhaps strange - is that well over two hundred years of bland 
scientific conformity or cowed reverence? Maybe the ideas were at large, but not 
voiced clearly, or the idea of discussing the motion of a non-static Universe was 
really too much a leap to make. 
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It is possible for the sum of the density parameters to be greater than unity, 
but fc = — 1 is then required to restore order. 

Current observations [212,94] suggest that f?A — 2l7m, ~ 0 and 

h ~ 0.65. Because of the dependence of the terms on i?, it is inevitable 
(barring a truly cataclysmic phase change when the Universe suddenly ends 
up in a lower energy state than at present) that A will dominate the dynamics 
of the expansion at late times. While a non-zero cosmological constant may 
not be the simplest, neatest form for the Universe, it does ensure that there 
is exponential expansion at late times, and thus a sort of satisfying time- 
independence of the ultimate evolution. The energy density in radiation 12^ 
is already smaller than by a factor of 17000, but dominated the energy 
density when z > 17000. An equivalent to A is also thought to have dominated 
the energy density in the early inflationary epoch, and then converted to 
conventional energy in the phase change that ended inflation and reheated 
the Universe. 



4.3 Horizons 

Whether the Universe has a horizon at early or late times, beyond which 
causal contact cannot be established or maintained depends on whether the 
integral of comoving distance along a geodesic / dt converges. If it does, 
then there is a finite comoving distance beyond which we cannot see, either 
from t = 0 or to t = oo. We seem to live in a Universe with R oc exp(t) at 
both early and late times, and so the integral should converge when taken 
both from 0 — >■ to and from to ^ oo. Hence, there are places in the Universe 
that are currently casually disconnected from us, but which may subsequently 
come into contact, places that are currently in contact but will subsequently 
be moved out of contact, and places that never have and never will be in 
causal contact. At very late times the visible Universe will be made up of 
only the bound local group of galaxies, with all other structures having been 
redshifted beyond sight and influence. It is likely that we would never be able 
to communicate with and receive a signal back from a galaxy at z ~ 2 as the 
Universe is expanding too rapidly for causal contact to be maintained before 
there is time for a reply to be sent^^ [180]. 

4.4 Distance Measures 

The comoving distance D ((3)) to a redshift z is just the comoving radial 
coordinate r in (9), 12 = r. This is the comoving or effective distance [184,205]. 
This measure of distance has the advantage that it does not change with time, 
and that the comoving volume element associated with a shell at radius D has 
the form dU = 47rI2^ dD/y/1 — kD'^, read straight from the FRW metric with 

Alison Farmer’s advice on reviewing this paper at the Journal Club in Caltech 

in November 2001 was “don’t date (in the American sense) at z > 2”. 
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Redshift z 

Fig. 7. The cosmological volume element dV/Az as a function of redshift in three 
different cosmologies. The relative differences between the three volume elements 
is more significant than the differences in the distances shown in Fig. 8. 



no redshift-dependent terms. The factor of (1 + z) in the denominator of (3) 
makes sense in this case, reflecting the competition between the increasing 
photon arrival interval, decreasing photon energy and increased fractional 
bandwidth to receive a signal in unit frequency range in the observer’s frame 
as compared with its emission in the frame of the observed galaxy, all of 
which depend on (1 + z). The volume element associated with unit solid 
angle dV/dz is shown as a function of redshift in Fig. 7. 

Two other measures of distance are often used - the angular diameter 
distance cJa and the luminosity distance dh- The angular diameter distance 
is the distance that describes the angle subtended by a rod of proper length 
I observed at cosmic time te by an observer at cosmic time to- If the rod is 
aligned in the 0 direction on the sky (with dr = d</> = 0), then the rod is 
defined to subtend an angle d0 = l/dA- From the FRW metric, ds = I = 
R{te) r^dO, and so c?a = R{te) fe- Sensibly, this equals the proper distance 
to the rod observed at te in a flat Universe. If fc = —1, then the light paths 
to each end of the rod diverge, increasingly quickly with increasing distance, 
and the proper distance exceeds (Ia- If fc = 1 then light rays to the end of 
the rod diverge decreasingly quickly with increasing distance, and the proper 
distance is less than d^- The redshift dependence in the expression for c?a is 
solely due to the evolution of R since the radiation was emitted. 

The luminosity distance dn is defined in terms of the total energy re- 
ceived from a distant object over all frequencies or in a spectral line. In order 
to calculate the total flux density (W m“^) received from a source with a 
luminosity L (W), light emitted at time tg from coordinate should be con- 
sidered at by the time it arrives at the observer at time to- The light has 
then spread out over a spherical shell with area 47ri?^(to) As a result, the 
flux density S = L/[47ri?^(to) re(l -I- z)^], where the two additional factors of 
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(1 + reflect the reduction in both the rate of photon arrival and energy 
of each photon due to redshift 1 + z = R{to) /R{te)- dn can then be identified 
in S' = L/{4TTd1), and so dn = d?(to)fe(l + z) = (1 + z)‘^dA, differing from 
and exceeding dA, once a further 1 + z factor is included from scaling R{to) 
from R{te). 

There are three practical definitions of distance, the effective distance D, 
which transforms spectral luminosity and the volume element most natu- 
rally, the angular diameter distance dA, which transforms the angular size of 
objects on the sky, and the luminosity distance du, which transforms either 
monochromatic or bolometric luminosity. The relative redshift dependences 
of dA and dj^ are compared in Fig. 8. Angular diameter distances are always 
less than luminosity distances. This has the consequence that high-redshift 
objects become too faint to detect more rapidly than they become impossible 
to resolve. Intuitively, this can be thought of in terms of the ever increasing 
redshift sapping the energy of the source, while the finite distance to the edge 
of the observable Universe means that once a source is very far away, it is not 
easy to make it appear appreciably more distant by moving it farther into 
the background. 





Fig. 8. The differences between luminosity and angular diameter distances for three 
historically reasonable valnes of the cosmological parameters. The flat, non-zero 
cosmological constant model represented by the solid line is now much favoured by 
observations. Note that the luminosity distance always increases, whereas the angn- 
lar diameter distance increases and then declines. This is loosely due to the emission 
of photons at early time by objects that were then closer, and so ‘looming over’ us 
on the sky. This explains why CMB signals that started their journey towards us 
from points separated by only 300,000 lightyears on the sky 1.5 x 10^° years ago and 
have therefore traveled about 5 x 10^^ times further than their original separation 
are separated by the relatively large angle of 1 deg at the present epoch. 
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Surface Brightness Dimming: A consequence of the different depen- 
dences of angular diameter and luminosity distance on redshift is that the sur- 
face brightness of a source at redshift z, which depends on d\/d^, decreases 
rapidly with increasing redshift as (l-|-z)“^: ‘cosmological surface brightness 
dimming’. This can be especially significant when observing galaxies against 
a sky background due to the atmosphere or to blended emission from more 
nearby sources, and can possibly account for significant misinterpretations of 
the appearance of distant galaxies. If only the brightest parts of the galaxy 
can be detected, then apparently fragmented, multiple structure in a galaxy 
could simply be a biased view of the brightest regions of a larger underly- 
ing galaxy. Surface brightness selection effects are not currently important 
at mid-IR and longer wavelengths, as the resolution of telescopes operating 
at these wavelengths are insufficient to resolve distant galaxies: their sur- 
face brightness distribution is diluted by a point spread function much larger 
than the intrinsic angular size of the galaxy. This confers no advantage over a 
properly resolved image! In future continuum observations of resolved galax- 
ies using ALMA, the peak surface brightness in an image could perhaps be 
used as a distance indicator: a detectable galaxy at redshift z = 10 would 
have to have an exceptionally bright region, with a surface brightness over 600 
times greater, in order to equal the observed surface brightness of a galaxy 
at z = 2. Any galaxies with bright knots of emission would then have to be 
at relatively low redshifts. 

4.5 Growth of Perturbations 

Once the form of evolution of the scalefactor R is determined, then the back- 
drop of the evolving Universe is set, and it is possible to investigate the 
evolution of its contents in more detail. In particular, a description of the 
growth of overdensities is required to explain the growth of structure from 
initial smoothness towards the overdensities by factors of 10^“^ that exist now 
between the scales of galaxies and the IGM. At recombination, the CMB re- 
veals density fluctuations in baryonic matter at the few 10“® level, although 
the overdensities in CDM, being immune to radiation pressure, should be 
greater. 

This can be done by a parallel analysis to the Jeans’ analysis for star- 
forming clouds in the ISM, by linearizing the Poisson equation for the gravi- 
tational potential, the continuity equation for conservation of mass, and the 
Euler equation for fluid motion, in comoving coordinates locked within the 
expanding Universe. The analysis [82] leads to an equation for the evolu- 
tion of the overdensity of a patch of the Universe with a mean density p, 
S = {p — p) / p oi the form, 

J -I- 2H{t)6 — 4:TTGp(t)6 = 0 . 



( 11 ) 
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The equation involves the Hubble parameter, which includes the dynamics of 
the Universe. Note also that p{t) has a strong R~^ dependence in a matter- 
dominated model. 

In a static situation, H{t) = 0 and the 5 term vanishes, returning the 
familiar solution of an exponentially growing mode, from the star-formation 
Jeans’ analysis. If this was the case, and there were no complications from 
magnetic fields, as arise in Galactic situations, then there would not be a 
problem to develop the rich structures we observe at the present epoch from 
almost smooth initial conditions at z ~ 1100. However, in an expanding Uni- 
verse, the first-order term does not vanish, and acts to reduce the speed of 
growth. In the case of a matter-dominated Universe, which is a good ap- 
proximation at z ~ 10-1000 where the growth of perturbations starts, then 
H oc and so the results are a growing mode with <5 oc (or [1 -|- z]“^) 
and a decaying mode with S oc A power-law growing mode is quite a 
challenge to structure formation - from recombination at z ~ 1100, even a 
very rare 10“^ fluctuation will reach only J ~ 0.1 by the present epoch. As 
non-linear growth is expected to set in at J ~ 1, a recombination overdensity 
of order 10“^ would thus be required to meet this condition by the present 
epoch. Such overdensities are extremely rare, from the CMB fluctuation spec- 
trum with an amplitude of several 10“® [94]. The situation can be rescued if 
there is a dominant dark-matter field, in which fluctuations can grow with- 
out needing to wait for photon pressure to vanish at recombination. They 
can in fact start growing as soon as the Universe becomes matter dominated 
at z ~ 17000 [193]. This allows a very typical 10“® dark-matter perturbation 
to reach a significant overdensity S ~ 0.17 at the present epoch, and for a 
not unusual 10“^ fluctuation to reach J ~ 2, assuming linear growth. 

The cosmological constant does not yet completely dominate the dynam- 
ics of our Universe, but will do several Hubble times into the future. When the 
cosmological constant dominates the dynamics, then the growth of structure 
is powerfully suppressed and perturbations do not grow: H is then constant, 
not falling, and so the 5 term remains large, while the driving density falls 
rapidly due to the exponential growth of R. Hence, the Universe we see to- 
day is unlikely to develop a much richer array of structure as it enters a 
A-dominated phase. In an unfavoured open, low-density model Universe the 
growth of perturbations can continue at a low level, but structure formation is 
retarded at z < leading to the formation of massive structures at earlier 
times than in denser models. The abundance of clusters at z > 0.5 supports 
a low-density model: if = 1, then it is difficult to generate sufficient early 
growth of fluctuations to lead to the observed abundance of clusters, without 
too much structure in erg, the width of the distribution of the density contrast 
on Mpe scales at the present epoch. 




42 



Andrew W. Blain 



Carroll et al. [71] provided a very useful and accurate analytic fitting 
function to compare the growth of fluctuations in different cosmologies, 



<^(^ = 0 ) 5 

<5Eds(2: = 0) 2 



+ 





(12) 



in which JecIS oc is the overdensity in an Einstein-de Sitter model. The 
overdensity at the present epoch is expected to be about 65% of the value in 
an Einstein-de Sitter model for Ha = 0.7 and = 0.3 and the same value 
of h. The results of some exact calculations for S{z) are shown in Fig. 9. 

The growth of small CDM perturbations in the early Universe generates 
potential wells into which baryonic matter can collapse and cool, beginning 
the process of galaxy and star formation. The scale and rate at which it takes 
place are controlled by the details of the astrophysics in each halo, and by the 
properties of the initial density field. For example, if there is a preferred scale 
on which fluctuations are unusually large, then this could be represented in 
the observed distribution of galaxies. Note that the power-law faint-end slope 
of the Schechter luminosity function implies that either this is not the case, or 
the structure was subsequently washed out as galaxy formation continued. No 
trace of a strongly preferred scale appears to remain in the galaxy correlation 




Redshift 

Fig. 9. The form of growth of perturbations in popular Universal models as com- 
pared with the Einstein-de Sitter model {Oq — 1; l7/i = 0). The value of 5 is 
important as it controls the linear growth of overdense structnres, and the redshifts 
at which structnres greater than a certain mass appear. In the Einstein-de Sitter 
model 5 oc (1-1-2:)“^ = R. In lower density models the grow of perturbations is 
retarded at later times, but only by a small factor. Identical normalisation at early 
times is reasonable, as all models are matter-dominated at early times. The flatten- 
ing of the curves at low redshifts reflects the small interval of cosmic time available 
there in the plot. Note that the growth rates are not converging to the same value 
at the present epoch. 
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function [83]. The angular scale corresponding to the first peak in the CMB 
power spectrum is on supercluster scales, and so does not appear to have had 
a significant influence on galaxy properties. 



4.6 The Growth and Collapse of Individual Halos 

The Jeans’ analysis discussed above provides a way to assess the growth 
of small statistical fluctuations. However, it provides no insight into how to 
track the growth of an individual overdense halo. For this study it is essential 
to track density contrasts as they grow non-linearly to J ~ 1 and greater, 
corresponding to the formation of bound galactic halos. This can be done 
in an Einstein-de Sitter model (a reasonable approximation at early times) 
by tracking the evolution of individual overdense regions in the Universe, 
described as closed {k = 1) sub-Uni verses with their own solutions to the 
Friedmann equations. The radial evolution of ‘successful’ perturbations that 
collapse from the Hubble fiow and form bound objects has a cycloid solution 
by analogy. Expansion occurs to a maximum extent - ‘turnaround’ - before 
recollapse towards a point, following the solution for radius r at time t: 

r = H (1 — COS0), t = B{0 — sin9). (13) 



In order to ensure that a mass M is enclosed, and the acceleration f = 
—GM/r"^ is correct at the edge of the sub-region, = GMB^. By expanding 
both r and t as functions of 0, for small t, the 9 dependence can be eliminated, 
and 




(14) 



is found. Note that to first order the linear size of the region grows in the same 
way as the scalefactor in an Einstein-de Sitter model, while the density 
contrast increases as 3 times the first-order term in the bracket, and so also 
depends on for small t, as expected from the Jeans’ analysis. Differences 
arise when the density contrast becomes greater. Turnaround occurs when 
9 = TT (and t = Bit). At that point, r = 2 A, and by comparing the volume 
enclosed within this radius with the size to which the region would have 
expanded if J = 0, the over density at turnaround 



Jta 



1 

(W 





(15) 



That is, in an Einstein-de Sitter model, the density of a halo is about 5.6 
times greater than the average density of the Universe when turnaround 
takes place. If the overdensity had grown linearly, then the density contrast 
at turnaround would be (3/20)(6t/H)^/^, only ~ 1.06 at the corresponding 
epoch. Hence, non-linear growth is important by the time of turnaround, as 
Sp/p > 1. Note that without considering non-linear growth, the overdensity 
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of the collapsing region would be underestimated severely. Note, however, 
that this enhanced growth factor is not sufficient to remove the need for dark 
matter to drive early fluctuation growth. 

What happens as overdensities collapse? At some point shortly before 
or after turnaround the linear solution will fail, and an overdensity will 
break away from the cycloid solution. Being too dense to be affected by 
the surrounding Universe, the collapse will then proceed more rapidly. The 
gas and dark matter within will orbit and scatter, and should settle down 
into virial equilibrium - distributing the potential energy liberated during 
collapse, Vta — W into kinetic energy T. As the condition T = —VyI2 holds 
in virial equilibrium, and Vta ~ Vy = T, this requires Vta = W/2, and 
so a contraction by a factor of 2 from turnaround to the final state. In this 
way, the object becomes self-supporting at half of the turnaround radius, and 
never contracts to a point. In the cycloid solution, the value of r reaches half 
of its value at turnaround when 9 = 37t/2, which could be associated with 
the time of virialization. The corresponding fractional overdensity is 146.8. 
If 9 is taken all the way to 27 t, at t = 2irB, then the linear growth solution 
predicts S ~ (3/20)(127t)^/^ ~ 1.69. This condition d = 1.69 is often used as 
a threshold to define the collapse of structures in linear theory. 

Hence, bound galactic dark-matter haloes would be expected to be about 
150 times denser than the background Universe at the time they became 
bound/formed. At later times, the background Universe continues to expand 
and become less dense, but the size/density of the bound object remains fixed. 
There may be some minor reduction in size due to losses of energy in radiation 
and the ejection of angular momentum, but the density of dark-matter objects 
could thus provide an indication of the redshift of formation Z{. For example, 
the present density of the Universe is about 3.5 x 10^° Mq Mpc“^, but within 
a radius of 1 Mpc, rich clusters of galaxies are approximately 10"^ times denser. 
Hence, the Universe would be expected to have expanded by a factor (1 -V 
Zf)^ ~ 10^/150 since clusters virialized, implying Zf ~ 3. That Zf is remotely 
similar to a plausible formation redshift for clusters suggests this is plausible. 
A value of Zf = 3 actually makes more sense as the redshift at which the 
cluster member galaxies formed. In N-body simulations, it is conventional for 
bound objects resembling rich clusters in mass and extent to form at z < 1. 

This treatment refers to the progress of the collapse of an overdense region 
of the Universe composed predominantly of dark matter. As described earlier, 
the complexity of describing the cooling baryonic material within and the 
onset of star formation and feedback is considerably greater. Simulations of 
these processes cannot currently claim predictive power. It is possible that 
we may always have to rely on observations to understand these early phases 
of galaxy evolution. 
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4.7 Press Schechter Formalism and Extensions 

One approach to studying structure formation and galaxy formation is to 
make N-body simulations of the gravitational interaction of dark-matter par- 
ticles, which if convergence is obtained, should provide an excellent guide 
to the gravitational development of structure. This allows the properties of 
individual galaxy dark-matter haloes to be related to the initial conditions 
in the Universe set in the simulation. 

A very neat way to combine the form of growth of perturbations and 
the initial fluctuation spectrum to generate a mass spectrum of bound ob- 
jects without numerical methods was developed in 1974 by Press & Schechter 
[216]. The result is analytic and includes in a relatively transparent way the 
length- and time-scales that are important in determining the properties of 
the galaxy population. The formalism has been extended [61,59] to account 
for the abundance of collapsed bound halos that are part of larger structures, 
which it is necessary to treat when considering galaxies within clusters. The 
formalism has also been adapted [237] to make a better fit to the results 
of high-resolution numerical simulations [159], by taking into account that 
halos are typically non-spherical, and so are expected to collapse along their 
shortest axis first. This factor was neglected above: at the level of our under- 
standing of galaxy formation is probably not the worst assumption made. 

There are two parts to the Press-Schechter description. First, tracking the 
growth of the perturbations that eventually collapse to form galaxies, which 
determines when an overdense region turns around and starts to collapse. 
Second, the way in which the amplitude of fluctuations depends on the initial 
conditions imposed on the density field. 

We investigate the consequences of the Press-Schechter description for 
the rate of mergers between galaxies, which are likely to be the biggest factor 
controlling the evolution of galaxies at far-IR wavelengths, as there is a clear 
association between merging objects and the most luminous dust-enshrouded 
galaxies [226]. 



Scales of Overdensities: An overdensity field S(x) can be expressed as a 
sum of Fourier modes in a large volume with periodic boundary conditions 
as J^Skexp(-ik.x). When integrated over a representative volume V, 



/ S(x) exp(-ik'.x) d^x = 

Jv Jv 



5k exp[— z(fe' — k).x] d^a: = V5k' 



(16) 



If the variance in the density field is defined as a, then 

k k 

where 6k is a scalar, whose properties can be described by a one-dimensional 
power spectrum in Fourier space P{k) = 5'^, as U — >■ oo and the universe 
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becomes fully homogeneous and isotropic. The variance in the density field 
can be represented by, 



a^V = aHTTl^dl = 



(27t)- 



J P{k) dirk'^ dk . 



(18) 



It is desirable to exclude contributions from fluctuations on scales greater 
than a cutoff radius, which defines the mass associated with a fluctuation of 
interest, and so it is better to define the spread in the overdensity spectrum 

by, 




P{k)W{k) k^dk , 



(19) 



including a window function W (k) to sample the fluctuation spectrum on 
a chosen range of scales. A Gaussian window function Wr oc exp(— (r/i?)^) 
transforming to Wk oc exp(— (fci?)^) has the advantage of no sharp features in 
either real or Fourier space. Note that will be convergent if P{k) oc fc” with 
n > — 3 on large scales (fc — >■ 0) and n < — 3 on small scales {k — >■ oo) without 
a window function. Hence, the fluctuation power spectrum P{k) must curve 
to avoid inconsistency. 

n = 1 is predicted by inflationary theories of seed structures that grow 
from quantum fluctuations as the Universe expands exponentially on all 
scales. This is a ‘scale-independent’ or Harrison-Zeldovich spectrum, for 
which the typical amplitude of fluctuations entering the horizon is constant 
in time, or equivalently, for which the depth of potential wells associated with 
fluctuations on different spatial scales is always the same. It is sufficient to il- 
lustrate using an Einstein-de Sitter model, because at early times after recom- 
bination the Universe is close to being matter-dominated. On a chosen scale, 
(T^ oc where M is the average mass enclosed within that scale. 

Fluctuations, and thus a, grow as and so cr oc On the 

horizon scale, the enclosed mass is the product of the density, which depends 
on R~^{(x and the cube of the horizon scale (oc Hence, the mass en- 
closed within the horizon M (x t, and within the horizon a oc t(2/3)-(3-i-n)/6^ 
Thus, for a to be independent of epoch, n = 1. Causal processes can only 
operate on scales smaller than the horizon, and so the amplitude of fluctua- 
tions that can start to grow and begin to affect the galaxy formation process 
is constant in this situation. 

Once the process of galaxy formation gets underway, non-linear collapse 
modifies the spectrum of fluctuations [204]. The two-point correlation func- 
tion of galaxy positions on the sky can be transformed to yield the dependence 
P{k) oc k on small scales. Only on the largest scales, corresponding to 
both CMB fluctuations over 10-deg in extent on the sky, and spatial scales 
greater than the largest superclusters is the spectrum close to scale-invariant. 
All-sky and wide-area CMB observations confirm that n is within about 5% 
of unity on these scales. The all-sky CMB results from WMAP provide a 
very accurate measurement of n on the largest scales. 
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Components in the Press— Schechter Formalism: If the expected spread 
in the magnitude of the fluctuations on a scale R, involving a typical mass 
M, are described by cr, and if the fluctuation spectrum is Gaussian, then 
the probability of finding a region of mass M with overdensity 5 exceeding a 
certain value i5c is 



F = 




1 

t) 



exp 



52 1 

2cr2(M, t)_ 



dd , 



(20) 



which can be re-written as a mass spectrum, 

Nps(M)dM = - — (21) 

a depends on M via the links to the power spectrum P{k) oc fc”, with a oc 
where 7 = -(3-1- n)/3, and its time dependence is determined by the 
evolution of S. 

The criterion usually applied for the formation of a bound object is that 
an overdensity turns around from the Hubble flow and goes on to collapse 
to a point in a time-symmetric way by epoch t if the overdensity exceeds 
the critical value i5c — 1.69 for linear growth that matches the whole collapse 
cycle. As the mean size of fluctuations grows steadily with cosmic time, this 
corresponds to a greater number of bound objects exceeding the Sc barrier at 
later times. More sophisticated descriptions of the process in careful, rigor- 
ous statistical terms, involving the trajectories of random walks through the 
density held, can be found elsewhere [20,237]. Care is necessary to normalise 
the Press-Schechter function correctly - simple treatments only recover half 
of the total density in the final function. The reason for this is still rather 
nebulous, but is linked loosely to the lack of treatment of underdense voids in 
the description, and tends to be explained away by the claim that a fraction 
of material accretes gradually into halos after their initial collapse. However, 
the agreement between the predictions and the results of high-resolution N- 
body simulations [159] is good, especially after modifications have been made 
to the empirical/collapse condition [237]. The Press-Schechter function can 
be used in variables Sc and in which case 



Nps{M,t) 



p I Sc da / \ 



(22) 



however, a more useful description [112] is to convert to the mass of bound 
objects as the main variable and define M*{t) to represent the typical mass 
of a bound object as a a function of time. In the hierarchical picture that 
the formalism represents, M* increases monotonically with time. When the 
mass description is used. 




iVps(M)=7^^ 



(23) 
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This form of this function is related directly to the mass of bound objects, and 
can thus be converted into a luminosity function via a mass-to-light ratio, 
or moments can be determined to study the cross section to gravitational 
lensing, for example. 

The empirical modifications to the Press-Schechter formalism to account 
for the non-spherical collapse of haloes [237] are described by. 



iVsT = A\fa 



1 + 




(24) 



where A ~ 0.32, a ~ 0.7 and p ~ 0.28 [237,211]. For the standard Press- 
Schechter function A = 1 and p = 0. The difference from the Press-Schechter 
function is most pronounced at M* and below. 

It is necessary that both J N dM = p and jMNdM = 0 to ensure 
that the total mass is normalized correctly, and that mass is conserved as 
the mass function evolves. It is possible to restrict the lower limit of these 
integrals to bound the possible values of dark-matter halo mass, and there 
could be gradual growth of bound objects from an almost homogeneous IGM 
component with time rather than all being derived from the merger of pre- 
existing bound objects. 

A simple merger rate of galaxies can be derived, to provide a neat account 
of the rate and properties of the major mergers between galaxies with similar 
masses that likely account for most of the activity seen in dusty galaxies. The 
merger rate can be determined from fVps. At low masses, galaxies continually 
merge from a chosen mass to form bound objects at greater masses, and while 
less massive objects continually combine to form new haloes at the chosen 
mass. The slope of A^ps at low masses should not change with epoch, as on 
sub-cluster scales that are safely above any minimum halo mass, and below 
the evolving value of M* the power-law distribution of masses of nearby halos 
should lead to self-similar, complex, random merger behaviour. To conserve 
mass, the value of fVps must be negative at low masses, and positive at high. 
This reflects the hierarchical progression of structure formation, sweeping up 
from low to high masses with cosmic epoch. 

Including the Sheth-Tormen modification, there is no requirement for a 
low-mass cutoff X in the density normalisation integral, as 

/ MNst dM = ^ / C/-1/2 ( 1 -k -C/-P ) exp(-C/) dU . (25) 

Jx v'^ Jx V 2 y 

For both the Sheth-Tormen description (unless p > 0.5), and for A = 1 and 
p = 0 (standard Press-Schechter), this integral converges to p for X = 0. 
The value of Nps or iVsT at small masses (M <C M*) must also be negative: 
this requires the same constraint on p. 

Some form of empirical correction is then required at low masses to obtain 
a ‘formation’ rate, which must at least be zero for all masses. The slope of 
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A^ST and TVst at low masses is the same, and so 

-^form = -^ST + </'A^ST ( 26 ) 

is a good choice at these low masses. At large M ^ M*, N ^ N, and the 
straight derivative Ast provides a good description of the formation rate 
[48,49]. An additional term or two to ensure good behaviour at M ~ M*, 
where most activity occurs is also necessary. However, the details are not 
important here. The most significant part of the treatment is to overcome 
the negative Aps at small masses, as mergers must still be taking place on 
those scales, at least within a few orders of magnitude of M*. 

To find the total amount of material involved in mergers as a function of 
time, integrate MAform over mass. The result depends only on M* /M* , and 
thus on the rate of fluctuation growth, and explicitly not on the value of p 
or A. Hence, the rate at which mass is processed through merging galaxies 
depends only on S(t). It is then a question of determining the efficiency of 
each merger in forming stars or fueling an AGN, which could depend on both 
mass and cosmic epoch, and have a complex internal time evolution, in order 
to relate the results of observations. 

Building an evolving mass spectrum for galaxies based on both the grav- 
itational growth of dark matter fluctuations in specific cosmological models, 
and the initial conditions describing these fluctuations provides a neat, ana- 
lytic way to investigate the formation of galaxies. It is in the transformation 
from mass to luminosity that most of the complexity and uncertain astro- 
physics lurks. 

4.8 Angular Momentum of Galaxies 

Most stars in the Universe are in pressure-supported Galactic bulges [130], 
but rotationally supported spiral galaxies are most numerous at the present 
epoch. Their angular momentum was picked up somewhere along their for- 
mation path, and has not been erased by merging activity. As well as the 
mass distribution of the dark-matter haloes that comprise the underlying 
building blocks for the galaxy- formation process, the issue of the distribu- 
tion of galaxy spins is important, especially as many of the most luminous 
low-redshift dusty galaxies appear to be merging. 

The key process that generates angular momentum is tidal torquing of 
non-spherical halos against each other during the linear phase of growth of 
fluctuations. Analysing the numerical halos formed in N-body simulations 
tends to indicate a log-normal distribution of the dimensionless angular mo- 
mentum parameter A = J| where J is angular momentum, 
M is mass, and E is the binding energy of a virialized halo. For a body 
with size r and typical internal velocity u, J ~ Mrv, E ^ Mv'^, and so 
A ~ rv^ /GM . If it is rotationally supported, then ^ GM/r, and so A ~ 1. 
For the Milky Way, r ~ 8kpc and v ~ 220kms“^ enclosing M ~ 10^^ Mq, 
and so A ~ 0.7. 
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Tidal torquing takes place as overdensities grow. If two adjacent over- 
densities with density contrast 6 and mass M are separated by a comoving 
distance r, then their mutual acceleration a ~ GM5 , where R is the 
scalefactor. Only the overdensity unbalanced by other coordinating clouds 
leads to an acceleration. If the overdensities are asymmetric, then a torque 
can act with a lever arm I < rR/2, less than half the separation of the clouds. 
The torque T ~ aMl = GM'^S/2Rr. Note that as fluctuations grow as S (x R, 
matching the growth of the Universe, the torque is constant. After turnaround 
at txA) torque from the asymmetry becomes less effective, due to the lever 
arm vanishing as the halo collapses. Angular momentum should thus build up 
uniformly from very early times until turnaround, when it reaches J = TtxAi 
and E ~ GM"^ /tRta- Recall that the overdensity at turnaround is expected 
to be 97 t^/ 16. By expressing the time at turnaround txA — to{RTA/ 
and M ~ po(47r/3)(ri?o)^) the value of A ~ to^Gpo, or about 0.3 [205]. 

This means that galaxies are not too far from being rotationally sup- 
ported, whatever their formation epoch, and motivates the idea that the 
earliest objects are likely to be disks rather than monolithically collapsing 
slabs or low-angular momentum spheres of gas, which could exist but are 
probably infrequent. Thus perhaps a picture of the formation of elliptical 
galaxies as mergers of ensembles of small rotationally supported disks is the 
most likely scenario for galaxy formation, as included in the recipes for most 
semi-analytic models. However, these models tend to fail to reproduce a suf- 
ficiently large number of very luminous galaxies, which may form stars in a 
more efficient mode that consumes more of their gas content more rapidly. 
Direct observational untangling of these issues is a key goal for future ob- 
servations of high-redshift galaxies, and these questions have certainly not 
yet been settled. These considerations also apply only to dark matter. Once 
collapse begins, the angular momentum of the baryons and dark matter can 
evolve differently. 

4.9 Correlations and Peculiar Velocities 

Another fossil record of the galaxy formation process is the clustering strength 
of galaxies, which seems to be a Universal power law, with the probability of 
finding a galaxy a distance r from a chosen object varying as ^ ~ 
with 7 ~ 1.8 and with tq — 5 — 12h~^ Mpc depending on the type of galaxy. 
The 2-point correlation function, the Fourier transform of the power spectrum 
of fluctuations P{k), is being measured in unprecedented detail by the 2dF 
and SDSS projects [210,256]. Higher-order correlations between galaxy posi- 
tions carry information about non-Gaussian structure present either at the 
beginning of (not expected in simple inflationary models) or imposed during 
the process of galaxy formation. Peculiar velocities induced by overdensities 
also offer a complementary approach to probing large scale structure. 
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4.10 Cosmology and Studies of Galaxy Evolution 

The details of the galaxy formation process are likely to be dominated by 
the internal astrophysics of galaxies, strongly affected by environment, and 
properties like metallicity that are determined by the previous history of the 
material in the galaxy. The easiest effect of the background cosmology to 
quantify is on the appearance of a galaxy as a function of redshift, via the 
interaction of the shape of the redshifted SED of a galaxy and the shape of 
the Universe. This provides a distinct advantage when investigating galaxy 
evolution via emission for cool galaxies at submillimetre and far-IR wave- 
lengths. 

The counts and background radiation intensity are affected by the values 
of the cosmological parameters, via r(z) for the background ((4)), and also 
by the distance parameter D{z) for the counts ((2)). These parameters are 
probably now reasonably well-determined, and over the next few years, they 
should be fixed to better than 1% accuracy by precise measurements of the 
CMB. 

4.11 K Correctious 

The detectability of distant galaxies depends on several factors. The sensi- 
tivity of a particular instrument often imposes a limit to how many distant 
galaxies can be studied, and only if the instrument can search a wide enough 
area to detect a high-redshift galaxy in a practical integration time can cos- 
mological observations be carried out. For example, in the 1960’s only the very 
brightest of distant QSOs could be detected using photographic techniques, 
even on the 5-m Hale telescope. The introduction of electronic detectors since 
the 1970’s has allowed quantitative high-redshift astronomy to develop. 

The brightness of foreground emission from the telescope, atmosphere. 
Zodiacal light and low-redshift objects can also be difficult to handle. The 
relative importance of these effects differs from waveband to waveband. In 
general, the finer the angular resolution, the less important the foreground 
signal. Cosmological surface brightness dimming could easily render all or 
part of more diffuse high-redshift objects undetectable, and could account 
for part of the unusual morphologies suggested for galaxies in the deepest 
HST observations. The overlapping signals from faint galaxies clustered on 
the scale of the observing beam [52,150,206], source confusion noise, now 
limits the depth of submillimetre-wave surveys using 10-m class telescopes, 
as it once limited deep radio and X-ray observations. Once all these issues 
are understood, the intrinsic brightness of the galaxies can be considered. 

Depending on the form of the SED, galaxies at different redshifts can be 
relatively easier to detect at a particular wavelength. Investigations of the 
distant Universe at radio and optical wavelengths probe relatively flat SEDs, 
with fi, oc with a ~ — 1, and so distant galaxies become steadily harder 
to detect, both due to inverse square law dimming, and to redshift reducing 
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the fraction of the total energy that arrives in the observing band. This 
continues to affect the deepest optical and radio observations [222]. In Fig. 10 
we show the steadily reducing flux density expected from a standard, but 
increasingly distant galaxy in these bands. Surveys at near-IR wavelengths 
could potentially probe the stellar population of more distant galaxies out to 
high redshifts, at which absorption in the Lyman-a transition in intervening 
neutral matter is redshifted into the observing band. They also detect very 
red galaxies, with steeper spectra, /j, oc v ^ ® (the Extremely Red Objects 
‘EROs’ with R — K > 5.5). When redshifted, this steep spectrum makes a 
galaxy rapidly more difficult to detect. 




Redshift z 

Fig. 10. An illustration of the K corrections expected for a dusty star-forming 
galaxy from realistic to very high redshifts. The low luminosity chosen perhaps 
corresponds to a distant ‘first-light’ object. There are several key features. First, 
the dramatic inverted (sub) millimeter K correction due to redshifting the steep 
dust spectrum, in comparison with the undramatic monotonic decline of the 1.4- 
GHz radio flux density. Secondly, the peak of the flux density-redshift curves occurs 
at a constant (high) redshift for frequencies between about 30 & 300 GHz. This is 
because a minimum dust temperature is imposed by the CMB, 2.73(1 -I- z) K; and 
this controls the redshift at which the peak of the SED appears in the observing 
band. Thirdly, the detectability of the near-IR (5 /rm) and radio emission from very 
distant objects. As the SED of starlight in the mid-IR is almost a blackbody, the 
5 — /rm emission experiences a significant K correction for 2 : ~ 3 — 30, and is cutoff 
at the redshift above which Lyman-a absorption is redshifted into the observing 
band: about 2 ~ 38 at 5/rm. The thin dashed line representing radio emission 
includes synchrotron emission, but not free-free emission, which has an almost flat 
spectrum and dominates at the highest redshifts. The spectrum including free-free 
emission is shown by the thick dashed line. As the Universe is optically thin to radio 
emission, the free-free emission from ionized gas in the very first galaxies at 2 ~ 20 
could be detected using a future Square Kilometer Array (SKA) radio telescope, if 
the regions are sufficiently compact and the resolution is sufficiently great to avoid 
source confusion. 
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For historical reasons the effect of redshifting the spectrum is described 
as a, K correction. At small redshifts, the correction from redshifting the 
SED of the target population could be added as a minor correction term to 
a magnitude-redshift diagram, in addition to the small ‘E-correction’ from 
galaxy evolution. There are three especially interesting spectral regions for 
the iF-correction. One is in the (sub)millimetre waveband, where redshifting 
the steep fi, oc SED can lead to distant galaxies with the same bolomet- 
ric luminosity becoming brighter with increasing redshift (Fig. 10). Another is 
the rest-frame mid-IR waveband for galaxies that contain little hot dust, but 
have an evolved stellar population. The Rayleigh- Jeans thermal tail of stellar 
photospheric emission with f^, oc can then be probed. The third is at soft 
X-ray energies, of order 1 — 5 keV, where Compton-thick AGN surrounded by 
column densities of about 10^"^ cm“^ of gas can have the X-ray emission from 
their innermost regions absorbed, leading to an extremely blue X-ray spec- 
trum at frequencies less than the peak of the X-ray background radiation at 
about 30keV. This is closely analogous to the submillimetre situation, with 
the caveat that there are a large number of much redder foreground X-ray 
sources to weed out before the most obscured distant objects can be detected. 

Optical and Radio K Corrections: The effects of different K corrections 
are illustrated in Fig. 10 at a range of frequencies, extending out to potentially 
extremely high redshifts. The appearance of an 0-star photospheric SED in 
the near-IR waveband is shown for comparison. The K corrections for this 
source at 2 : < 1, and for a 1.4-GHz radio source are also shown. Both display 
a steep monotonic decline with increasing redshift. Similar declines in the 
detectability of galaxies are typical in the mid- and far-IR observing bands. 
The effects are illustrated for a very low luminosity galaxy, that might be 
typical of the very earliest luminous objects, and is a factor 10^“"^ times less 
luminous than the most exotic QSOs and ultraluminous dusty galaxies known 
at high redshifts [226]. 



Selection Effects at Long Wavelengths and Photometric Redshifts: 

The Universe is transparent to radiation at mid-IR and longer wavelengths. 
Beyond 300 /xm, the A'-correction effect of redshifting the spectrum renders 
very distant galaxies almost uniquely easy to detect (Fig. 10). However, these 
observations are not immune to selection effects. In particular, it is diffi- 
cult to distinguish a high-redshift galaxy from one with a cooler SED at 
lower redshift [109,45,57]. The effect is illustrated in Fig. 11 at wavelengths 
of 1.1mm, where it is very significant, and at 450 /xm and 175 /xm where it 
is much less dramatic. It is important to be aware of this effect when in- 
terpreting the results of far-IR and submillimetre-wave galaxy surveys and 
high-redshift observations of known galaxies, which can be very sensitive to 
their assumed SEDs. Based on long- wavelength continuum observations alone 
it seems to be impossible to avoid this temperature-redshift degeneracy [53] . 
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Fig. 11. Some of the uncertainties that affect the interpretation of the results of 
submillimetre and far-IR surveys. The left panel shows the effects of changing the 
cosmological parameters on the K correction and the volume of a unit-redshift 
shell in the Universe. While the (0.3, 0.7) model almost certainly represents the real 
Universe, the other models can be seen to produce rather similar effects. The right 
panel shows the much more significant effect of different dust temperatures/SEDs. 
Low-temperature galaxies with a fixed total luminosity are significantly easier to 
detect at all redshifts at submillimetre wavelengths, and at redshifts 2 ; < 1 at far-IR 
wavelengths (175 /im). At shorter wavelengths this effect is much less significant, as 
the SED is observed close to the peak restframe wavelength. The effects of modifying 
a and /?, the other SED parameters are much less significant [57] . The temperature 
dependence is sufficiently strong that the quantity T/(l -\- z) is constrained much 
better by colour data than T or z independently [45,57,53]. 



In the light of SIRTF observations, it may turn out that there is a clear 
luminosity-temperature relation which will allow redshifts to be estimated 
using a colour-magnitude diagram. However, the existence of warm, low- 
luminosity M82 and cool, high-luminosity ISO galaxies [76] tends to argue 
against a simple reliable application of such a technique to individual galax- 
ies. The range of possible SEDs, as a function of luminosity for galaxies with 
well-defined redshifts, multi-colour far-IR and radio data, assuming that the 
far-infrared-radio correlation holds is shown in Fig. 12. The significant mini- 
mum 25% scatter sets a limit to the maximum possible accuracy of a redshift 
determined using a multi-colour photometric technique [151]. 



5 The Cosmic Microwave Background (CMB) 

The CMB dominates the energy density in radiation in the Universe, exceed- 
ing the cumulative background radiation intensity from galaxies integrated 
over all wavelengths by a factor of order 100. Its intensity decays as 7?“^. In 
an accelerating Universe, this will lead to its eventual demise as the dominant 
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Fig. 12. Luminosity-temperature relations for dusty galaxies with known redshifts 
[53]. In the upper panel the galaxies are denoted by their sample origin: squares low- 
redshift IRAS galaxies with submillimetre data [107]; moderate-redshift faint radio 
galaxies in the HDF with ISO detections [133]; and very luminous IRAS galaxies 
with positions from the FIRST- VL A survey [29], and spectroscopic redshifts [252]. 
In the lower panel, sources from all samples are described according to their red- 
shifts: empty squares at 2 ; < 0.2; filled squares at 0.2 < 2 < 0.5; empty triangles at 
0.5 < 2 < 1.0; filled triangles at 1 < 2 < 2; empty circles at 2 < 2 < 3; and filled 
circles at 2 > 3. Sources represented by larger symbols have more accurate data. 
The lines in both diagrams represent other derived relations. The solid line shows 
results from low-redshift IRAS galaxy colours [79,91], while the dashed and dotted 
lines represent merging and quiescent populations of dusty galaxies respectively 
[53]. 
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radiation field, as long as galaxies continue to generate significant amounts 
of energy in the future. It is an extremely valuable tool for investigating the 
early Universe, and its properties can be modified by scattering along the 
line of sight to the last-scattering surface of its release from equilibrium with 
electrons at recombination (z ~ 1100), allowing physical conditions at lower 
redshifts to be probed. 

The key features of the CMB are its extremely accurate blackbody tem- 
perature, and its isotropy at the 10“® level. The detailed departures of the 
CMB from isotropy are very close to those predicted by the interaction of 
radiation and matter in the pre-recombination Universe. The low level of den- 
sity fluctuations, simple physics of Compton scattering and simple chemistry 
act to produce predictable peaks in the power spectrum of initial density 
perturbations on angular scales close to 1, 0.5 and 0.25 deg (/ = 213^^3, 
541^32 and 845lll25? where I is the Legendre polynomial index of the power 
spectrum [94]). These peaks represent the imprint of stationary phases in 
the gravity-driven, photon-electron pressure supported oscillations in the 
radiation-electron-baryon-CDM fluid prior to recombination. On scales cor- 
responding to oscillations that are moving at recombination, the phases of 
the signals do not add constructively and so the amplitude is lower. 

The time available for oscillations to occur is governed by the time interval 
between the epoch a particular scale becomes smaller than the horizon and 
recombination. Smaller scales came through the horizon earlier, and thus can 
go through a greater number of cycles before the pressure drops dramatically 
at recombination. The largest-scale peak corresponds to clouds that had time 
to make only a single infall to maximum density before recombination. Other 
odd-numbered peaks correspond to maximum density at recombination after 
additional oscillation cycles. Even-numbered peaks correspond to minimum 
density states: stationary rebounded clouds. There is a natural reduction 
in the intensity of CMB fluctuations on smaller scales due to the diffusion 
of photons out of overdense regions: Silk damping. Scales spaced equally 
between the peaks are associated with maximum velocities at recombination. 
Because moving electrons polarize scattered photons, the initially unpolarized 
thermal CMB should become polarized most strongly on these scales, with a 
polarization anisotropy amplitude about two orders of magnitude less than 
the (~ 10“®) amplitude anisotropies. 

The first hints of peaks in the CMB angular power spectrum was estab- 
lished by a wide range of experiments probing degree angular scales through- 
out the 1990’s. The clearest signal was provided by the ground-based Saska- 
toon experiment [199], which detected significant power on scales sandwiched 
between the well-defined anisotropy signal determined by the DMR instru- 
ment aboard the COBE satellite on scales between the dipole and 7 deg [34], 
and limits to the fluctuation amplitude matching the DMR level on smaller 
10-arcmin angular scales. The smaller-scale limits using SuZIE [132] and the 
OVRO 5-m antenna [174], and the detection of a DMR-level signal on inter- 
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mediate 10-arcmin scales using the first-generation CMB interferometer CAT 
[18]^^. The existence of detectable coherent peaks in the CMB itself testifies 
that the Universe remained neutral from z ~ 1100 until z ~ 20, otherwise 
the angular structures observed on the CMB would have been blurred out 
substantially by line-of-sight scattering. The definitive detection of the first 
peak was provided by the BOOMERanC and MAXIMA balloon experiments 
in 1998/1999 [94,173], and subsequent detailed mapping of the structure in 
10-deg^ fields using the DASI [69], CBI [238,208] and VSA [257] interferom- 
eters. 

The power spectrum provides a direct measurement of the angular size of 
an object with the physical scale of the horizon at recombination, allowing 
the angular diameter distance to be measured, and so providing an accurate 
constraint on the value of Cm -|- Ca [94]. The power spectrum has been 
measured using a variety of independent techniques, and is now one of the 
most solid observations in cosmology. The astrophysics of the interaction 
of thermal radiation with primordial ionized gas is also one of the cleanest 
processes in the Universe, and so the results put the initial conditions of 
galaxy formation on a very firm footing. In January 2003 WMAP reported a 
high-quality detection of the power spectrum all the way from the dipole to 
10-arcmin scales, including the crucial region at I ~ 50, where the rise of the 
power-spectrum from large scales to the first peak is sensitive to the speed 
at which recombination occurred. The detection of significant non-Gaussian 
anisotropies, or the failure to reproduce the existing power spectrum, would 
provide startling news. Measurements of CMB polarization in the next decade 
should provide an absolute test of the paradigm of initial conditions for galax;y 
formation building from CMB observations. 

5.1 Secondary Anisotropies 

Scattering of CMB photons from ionized structures since recombination also 
provides a useful information about galaxy evolution. The Universe was still 
very smooth when it became neutral at recombination, and there are no 
expectations that baryonic galaxy-formation processes could occur in almost 
uniform gas at over 3000 K. However, the reionization of the Universe began 
prior to z ~ 6[30,99], due to either the first stars forming or the first AGN 
beginning to accrete. Once reionization began, scattering of the CMB could 
occur. 

The Sunyaev-Zeldovich (SZ) effect [255] is the most significant mecha- 
nism for imprinting structures on the CMB. There are two components of 
the effect: a thermal and a kinematic part. The thermal effect is produced 
by Compton scattering of CMB photons off hot electrons. It imparts a char- 
acteristic increase to the net energy of CMB photons at the expense of the 
electrons, leading to a reduction in the intensity of the CMB at frequencies 

For a summary of about 20 first-generation CMB experiments see [172]. 
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Fig. 13. A comparison of the SZ spectrum in the centre of a reasonably massive 
cluster of galaxies, with an early limit to the background radiation intensity in the 
submillimetre waveband [190] and the first detection [217]. The relative sizes of the 
thermal and kinematic effect and the effects of including relativistic electrons can 
be seen. Note that at wavelengths longer than about 600 fim the SZ effect from 
clusters is likely to be larger than the typical integrated background signal from 
field galaxies. 



below 217 GHz, and to an increase at higher frequencies, that peaks at about 
370 GHz. The form of the intensity change, 

AI AT X exp X 

I T exp a; — 1 

where x = hv / U bTcmb (see Fig. 13). The term before the square brackets con- 
verts from a fractional temperature change to a fractional flux density change. 
The size of the effect is set by the parameter y = crTfcB/(weC^) f nT^ dl, which 
includes the integral of pressure through an ionized structure, via the electron 
space density n and temperature Tg] ctt is the Thompson cross-section of the 
electron. At long wavelengths, where most measurements of the signal have 
been made, AI/I = —2y, independent of frequency. 

From GOHF-FIRAS measurements of the GMB spectrum averaged over 
the whole sky y < 1.5 x 10“® [121]. In rich clusters of galaxies y > 10“^ and 
Tg ~ lOkeV, leading to a probability of scattering r = f nardl ~ 10“^. The 
detailed SZ signal from clusters has been detected by several experiments, 
including cm-wave detectors of the SZ decrement at both the OVRO/BIMA 
millimetre- wave interferometers [162] and the cm-wave Ryle Telescope in 
Gambridge [161], and at short, SZ increment wavelengths using the SuZIE and 
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SEST experiments. The well-sampled baseline structure of the OVRO/BIMA 
observations provide the finest images. With increased sensitivity, less dense 
structures such as intercluster filaments, and even individual galaxy halos 
may ultimately be detectable. There are no clear detections of the SZ effect 
from an ionized IGM or in smaller groups. This could change after reports of 
excess power on small angular scales detected by CBI announced at the AAS 
in January 2002 [238]. The excess power is greater than predicted previously 
for the SZ effect, and if confirmed, there seems to be a significant contri- 
bution to the background radiation intensity from the formation of clusters, 
with more energy being injected into the intracluster medium (ICM) than 
expected, leading to a hotter/denser gas. This type of signature should be 
detected clearly by dedicated cm- wave, arcmin-scale SZ interferometers un- 
der development. With close-packed arrays of tens of 4-5-m sized antennas 
to provide faithful images of the relatively low surface brightness of the SZ 
effect [149,167], these instruments will make accurate measurements of the 
resolved profile of the SZ effect in known rich clusters, search for the signature 
of connecting filaments between clusters, and seek high-redshift clusters. 

If the scattering electrons are relativistic, or if there is a significant prob- 
ability of multiple scatterings, then the SZ spectrum can be subtly modified. 
For relativistic scatterers, the null frequency increases slightly, and the incre- 
ment signal at high frequencies changes shape [221] (Fig. 13). In the standard 
model of structure formation by hierarchical clustering, the most massive ob- 
jects with the highest virial temperatures (Ty oc are expected at low 

redshifts, and it seems unlikely that numerous high-redshift clouds of rela- 
tivistic electrons are lurking to be detected this way. In principle, the rela- 
tivistic SZ signal could allow the measurement of the temperature of electrons 
in the ICM; however, this can probably be done more reliably using X-ray 
observations. 

The kinematic SZ effect is produced by scattering from electrons with 
a net peculiar velocity, and has the spectral dependence of the CMB. If 
the scatterers are approaching, then the intensity of the CMB is boosted, if 
receding, then the CMB intensity is reduced. The signal depends on only the 
peculiar velocity of the cluster. For reasonable cluster parameters, the peak 
of the kinematic signal is never likely to exceed about 10% of the thermal 
signal, except near the 217-GHz null frequency of the thermal effect, where 
it is the only contribution. It may be possible to make a peculiar velocity 
survey of clusters to high redshifts using this technique [39] , especially using 
the SZ null frequency channel of the all-sky Planck Surveyor CMB mapping 
mission. 

The surface brightness of the SZ effect is independent of the redshift of 
the scattering object. While cosmological surface brightness dimming reduces 
the intensity of the signal as (1-1- 2 )“'*, this is counteracted by the increase in 
the energy density of the CMB by the same factor at the redshift to the SZ 
scatterer. By comparison with the intensity of X-ray emission from the same 
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electrons that scatter the CMB, which depends on the different integral of 
n^Ti/^ along the line of sight, the absolute size of the cluster can be deter- 
mined, if T is known and uniform, and the cluster is spherical. By comparing 
with its angular size on the sky, Hq can then be determined. This is becoming 
more realistic, but more complicated than first thought, as high-resolution 
X-ray images of clusters from Chandra showing substantial substructure are 
combined with resolved centimeter/millimetre- wave SZ images. 

There are other less-significant secondary effects. The Ostriker-Vishniac 
(OV) effect, is the integrated equivalent to the SZ effect due to scattering 
from diffuse, ionized gas not associated with deep cluster potential wells. 
The Rees-Sciama (RS) effect is due to the evolution of cosmic structures as 
CMB photons cross them, leading to CMB photons being reddened as they 
climb out of a potential well that deepens during the crossing time. These 
effects have the same spectral form as the CMB, and are also independent of 
redshift, but are not expected to be greater than about 10% of the SZ signal 
[233]. 

The ionized gas that leads to the detectability of the SZ effect could have 
been generated by collisions as clusters formed, or by radiation released dur- 
ing the formation of the cluster member galaxies or their progenitors. The 
SZ effect provides a way to probe both the existence of bound virialized 
structures at high redshifts, and the way in which diffuse gas in these struc- 
tures was ionized. Within about twice the arcmin-scale gravitational lensing 
Einstein-ring radius of a foreground cluster, the lensed background SZ signal 
from more distant groups and clusters superimposed on the cluster’s own 
SZ effect should be smoother than that in a general field. Independent of 
redshift, this could provide a way to enhance fainter OV, RS or SZ effect 
signals. 



Non-Gaussian Features: Foregrounds, and especially sharp-edged linear 
and two-dimensional features like cosmic strings and textures, could intro- 
duce non-Gaussian features in the CMB. There are currently no hints of such 
structure, either via individual detections of unexpectedly hot or cold regions 
or of a non-zero higher than second-order correlation function of the CMB 
brightness. Any detection of a different type of fluctuation beyond the simple 
Gaussian field expected from quantum fluctuations during inflation could re- 
veal dramatic new physics^^. WMAP offered the first opportunity to detect 
such oddities, both by providing a large number of pixels for determining cor- 
relation functions, and covering more sky to increase the chance of detecting 
a rare, rogue feature; however, no clear signs of anything untoward emerged 
[168]. 

The two-point correlation function C2(jro|) = (T(r) T(r — ro)) fully describes 
the statistical properties of a Gaussian field: all higher-order correlation functions 
should be zero. 
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5.2 Foregrounds and Signatures of Galaxy Formation 

Both primordial and secondary features in the CMB are prone to being 
masked by and confused with foreground emission, both diffuse radio and 
infrared synchrotron, free-free and dust emission in the Milky Way, and the 
contributions of radio and submillimetre galaxies at high redshifts. In some 
regimes, it is not possible to detect CMB anisotropies, as the foreground sig- 
nals dominate. This has been investigated using synthesis models of known 
populations [258] . It has now been verified directly that the effects of galaxy 
foregrounds are not significant for primary, non-polarized anisotropy mea- 
surements by BOOMERanG, DASI, CBI, VSA and WMAP. Polarisation 
has also been detected [169,203]. 

The determination of the angular power spectrum of CMB fluctuations on 
scales from the whole sky to several tens of arcmin, at Legendre polynomial 
indices I < 4000, is a striking recent highlight in observational cosmology, 
and is reaching maturity with a wide variety of ground-based, and balloon- 
and space-borne surveys. 

Future surveys may allow foreground galaxies to be detected. For example 
the Planck Surveyor space mission will detect up to 10® dusty galaxies at 
350 to 850 yrm. Almost all of the issues concerning foreground contamination 
of the CMB signal should be settled when Planck flies in 2008. Pointing 
away from Earth, Moon and Sun at the Earth-Sun L2 point, Planck will 
map the whole sky 4 times over 2 years, imaging the CMB in 10 channels 
from 30 to 857 GHz, with a resolution of 5 arcmin at frequencies greater than 
200 GHz, and diffraction-limited resolution for its 1.2-m off-axis aperture at 
lower frequencies. While having limited polarization sensitivity, Planck will 
provide a definitive map of the whole submillimetre-wave sky on super cluster 
scales. 

Increasingly accurate CMB measurements will continue to make this cold 
component of emission from the Universe a vital one for both measuring the 
size and shape of the Universe and investigating its contents. 

6 The Evolution of High-Redshift Galaxies 

At some wavelengths, especially those of interest here, a simple census of the 
contents of the Universe is still being taken (see Fig. 14). At others, espe- 
cially optical and radio wavelengths, details of the properties of most of the 
galaxies detected (masses, metallicities and physical sizes) [235] have already 
been determined. It is easy to lose track of the fact that resolved, multicolour 
images of typical high-redshift galaxies have only been available in significant 
numbers since the observations of the HDF-N in 1995 [269]. There are sure 
to be more surprises lurking as the Universe is surveyed to greater depths, 
even at the well-studied wavelengths. We are still far from observing a rep- 
resentative sample of the first objects that re-ionized the Universe at z ~ 6, 
and their progenitors. 
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Fig. 14. Counts of galaxies detected at a range of far-IR, submillimetre and mil- 
limetre wavelengths, by a wide range of groups and techniques (a full summary can 
be found in [57]). This represents the full knowledge of the counts of dusty distant 
galaxies from published 175 — /rm ISO, 450/850 — /rm SCUBA and 1.2-mm MAMBO 
surveys to August 2002. Note that the range of flux densities probed is not very 
great, about a factor of 10. There is a need for both deeper surveys to inform us 
about the evolution of galaxies less luminous than L* - equivalent to about 0.1 mjy 
at 850 fim (see Fig. 10), and for wider surveys of the rarest, most luminous galaxies 
at brighter flux densities. It is possible that the bright counts fall very rapidly as 
a physical limit to the luminosity /mass of a galaxy is reached [142], before bright 
low-redshift galaxies or high-redshift gravitationally lensed galaxies [43] dominate 
the counts at the brightest flux densities. The overplotted lines correspond to lu- 
minosity evolution and hierarchical models of galaxy formation [54,56]. 



6.1 Optically Selected Galaxies 

The best-studied population of distant galaxies are the Lyman-break galaxies 
(LBGs) [253,254], with luminosities ranging from about 10 times greater than 
the Milky Way. These are selected using multicolour optical images from 4-m 
class telescopes, by searching for a sharply-defined, red colour break between 
adjacent filters, due both to neutral hydrogen absorption both internally and 
in intervening gas at the redshifted 91.2-nm Lyman-break frequency. Can- 
didate LBGs at 2.5 < z < 4.5 can be isolated in observations extending to 
the /-band. Spectroscopy of these photometric candidates using 10-m class 
telescopes confirms that about 70% are at these redshifts. The LBGs have 
a well-determined luminosity function [3], and detailed astrophysical infor- 
mation is generally available from optical and near-IR spectral properties 
[235,214]. At lower-redshifts, a comparable Balmer-break sample can be se- 
lected, while higher-redshift samples could be detected, at lower efficiency, 
by extending the range of filters used out to near-IR wavelengths. The LBGs 
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are a very important, well-defined baseline population of distant galaxies for 
comparison with the other galaxy populations in the distant Universe. 

6.2 Red Galaxies and Extremely Red Objects (EROs) 

By the nature of the SED break signature sought in the rest-frame UV wave- 
band, the LBGs have emergent rest-frame UV continuum flux, and are nec- 
essarily either rather blue in colour or very luminous if they are red and emit 
a trace of their total luminosity there. They display a wide variety of well- 
studied spectral features at longer wavelengths, but it is certainly very likely 
that the LBG selection process excludes galaxies with very red SEDs, or too 
faint, as insufficient UV flux escapes to yield a detectable signature. 

Adelberger & Steidel [3] have discussed the luminosity function of the 
LBGs and their relationship to other classes of high-redshift galaxies. A class 
of galaxies definitely eludes Lyman-break selection: the mixed bag of objects 
defined by extremely red colour, R—K > 5.5—6 - the Extremely Red Objects 
(EROs). This is amongst the least satisfactory of astronomical classifications. 
The condition R — K > 5.5 applies to less than 1% of galaxies in deep survey 
fields, however it is typical of both evolved stellar populations and strongly 
dust-extinguished galaxies with intrinsically blue stellar/AGN SEDs at mod- 
erate redshifts. Almost any red low-redshift galaxy would become extremely 
red and faint at 2 ~ 1, and invisible by 0 ~ 2.5. Hence, the classification fails 
either to refer to a unique intrinsic physical property or to select a homoge- 
neous population. Imposing a more severe constraint on R—K to ensure more 
extreme redness assists the detection of unusual objects, but even then careful 
follow up is required to sift out the luminous, obscured galaxies. Historically, 
the interest in EROs reflects the rare detection of interesting, anomalously 
red galaxies in narrow-field near-IR images taken in the mid-1990’s [140]. 
The advent of wide-held near-IR imaging has led to a hood of ERO samples, 
including relatively shallow surveys with K < 20 [90] and H < 21 [81], in 
which at least 80% of the detected galaxies are simply evolved ellipticals at 
moderate redshift z ~ 1, with follow-up observations [197] failing to detect 
radio and infrared emission to diagnose signihcant ongoing activity. Deeper 
surveys [245], most notably the Subaru Deep Field [259] may make a differ- 
ent selection. Unfortunately, no deep radio imaging is available in this held to 
select out the vigorous star-forming EROs. The AT ~ 20 EROs are strongly 
clustered [90], but as they are amongst the oldest populations of galaxies 
at moderate redshifts, they would be expected to have formed in the most 
overdense regions at z ~ 3 [20], and so this is not surprising. 

A link between EROs and distant submillimetre galaxies is expected: if 
the dust in these galaxies has the geometry of a screen, then their emission 
should be strongly reddened at optical wavelengths. That may be the case, 
although several of the best studied submillimetre galaxies are expressly not 
EROs [154,155], even if parts of these galaxies do have the required red colour 
[157]. Most of the submillimetre population is optically faint with / > 26 
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[243]. Verification that it is also extremely red requires imaging observations 
to K ~ 23, a feat that requires significant investment of observing time 
even with 8-10-m class telescopes [127]. At present, the association between 
distant ultraluminous infrared/submillimetre-selected galaxies and a subset 
of EROs is an open question. 

There is a direct link between the surface density of faint blue galaxies and 
the amount of energy they release. This is because the effects of redshifting a 
relatively flat blue spectrum f^, oc expected from a young stellar population 
counteracts the 1 + z factor in the equation for background energy density 
((4)) [249]. There is no such analogous conversion between the flux density 
of EROs and their cosmological significance. The hope is that the study of 
EROs develops in a similar way to how LEG studies grew from the faint, blue 
galaxy surveys in the 1980’s. The most important role they are likely to play 
is providing information about populations of evolved galaxies at moderate 
redshifts, revealing details of the process of elliptical galaxy formation at 
earlier times. As there is much speculation about the connection between 
submillimetre-selected galaxies and the formation of elliptical galaxies and 
spheroids, this may have a direct bearing on studies of the formation of dust- 
enshrouded high-redshift galaxies: EROs at 2 ~ 1 may be the fossil remains 
of all the submillimetre galaxies. 

A further motivation for seeking ERO counterparts to submillimetre galax- 
ies is the need to determine accurate positions. As a fraction of EROs are 
expected to be associated with dusty galaxies, then the detection of a rela- 
tively rare ERO within the positional error of a submillimetre galaxy is likely 
to provide a strong indication of a possible correct counterpart [127]. 

There is no published case of an EROs selected in a systematic near- 
IR survey being detected in the submillimetre, despite some searches, but 
many submillimetre galaxies have ERO counterparts [241,134,185], or are 
associated with slightly bluer, but still very red galaxies [126]. Only one has 
so far been verified as an undeniable counterpart with a secure redshifts [128]. 
In two cases, millimetre-wave interferometer observations have reduced the 
positional uncertainty of the initial submillimetre detections, co-locating the 
millimetre- wave emission and ERO, and reducing the probability of a false 
association [134,126]. 

6.3 Active Galaxy Selection 

Both optically- and radio-selected AGN have been detected at great dis- 
tances since the beginnings of observational cosmology. Only relatively re- 
cent developments in large telescope design have allowed more representa- 
tive high-redshift galaxies (the LBGs) to be detected in significant numbers. 
High-redshift AGN were amongst the first distant objects to be detected at 
submillimetre wavelengths in the mid 1990s, and dedicated searches continue 
[153]. A recent development is the first sign that QSOs at redshifts z 6 
are being viewed through an increasingly neutral IGM [119,140], so their UV 
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light is absorbed by Lyman-o; transitions in the intervening IGM - the Gunn- 
Peterson effect [144]. The re-ionization that makes the Universe transparent 
at lower redshifts is a key part of the history of galaxy formation, and the 
recent detection of the epoch at which it ceases to be effective represents 
real progress. Note, however, that the detection of re-ionization is not go- 
ing to affect progress in deep far-IR/submillimetre observations, where the 
opacity along the line of sight is negligible to even the highest redshifts. A 
more pressing concern for probing further at these wavelengths is that the 
metals necessary to allow the detection of dust and gas may not be available 
at higher redshifts, if re-ionization takes place at an epoch just prior to z = 6. 

6.4 Faint Radio Galaxies 

Up to three orders of magnitude less luminous than radio-loud AGN at ra- 
dio wavelengths are the faintest population of galaxies detectable using cur- 
rent radio telescopes, with 1.4-GHz flux densities down to about 10-20 /iJy. 
This is an interesting population with a surface density similar to the dusty 
galaxies detected using SGUBA and MAMBO (Fig. 14). If they lie on the 
tight far-IR-radio correlation observed at low redshifts [85,275], then they 
should sample the same population of dusty, luminous, high-redshift galaxies 
(Fig. 10). These studies [24,72] have targeted faint radio galaxies with the 
faintest optical counterparts, expected to indicate high redshifts, for searches 
for thermal dust emission with the SGUBA submillimetre camera. The result 
is a sample of well over 100 submillimetre galaxies with accurate sub-arcsec 
radio positions. This sample is likely to be biased to lower redshifts than 
the full submillimetre population, due to the less favourable radio K correc- 
tion, and to include an enhanced AGN fraction due to excess AGN-powered 
non-thermal radio emission. This radio population does, however, offer much 
easier follow up observations than galaxies selected only at submillimetre 
wavelengths [77]. 

6.5 Submillimetre Galaxies 

and the Optical and Radio Populations 

The selection effects at work in surveys for optical, radio and reprocessed 
thermal dust emission were discussed extensively by Adelberger & Steidel [3] 
and are illustrated in Fig. 10. They sought to exploit their large, well-studied, 
spectroscopically-confirmed sample of several 1000 optically-selected LBGs 
(at 2.5 < z < 4.5) to seek any less numerous, more luminous submillimetre- 
selected galaxies within, using the correlation between the rest-frame UV 
spectral slope of galaxies [194] which holds to luminosities L ~ 10^^ Lq 
for low-redshift galaxies, based on UV observations using the International 
UV Explorer (lUE) satellite. If the correlation holds all the way to the 
reddest LBGs, and to the luminosities of the submillimetre-selected galax- 
ies, then this is plausible. It appears to work for at least one identified 
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SMG, SMM J14011+0252 (Fig. 22) [155]; however, this galaxy is exception- 
ally bright at optical wavelengths; typical SMGs are up to 6 magnitudes 
fainter [243]. The most luminous IRAS galaxies at low-redshifts also have 
far-IR luminosities underestimated by the UV-far-IR relation [139]. This is 
easy to understand, as there are densely-enshrouded regions in the most lu- 
minous galaxies from which almost no UV light escapes, whereas more diffuse 
low-luminosity systems contain many fewer totally opaque regions. Thus the 
mean opacity (reflected in the UV-submillimetre ratio) and luminosity (from 
the sum of the emission in the optical/UV and submillimetre wavebands) 
appear to scale together. 

Submillimetre observations of optically-selected LBGs have tended to de- 
tect no emission [73], with the occasional, dramatic exception [75,155]. The 
best limits were obtained for cB58 [17], which is strongly magnified by a grav- 
itational lens. cB58 appears to be 40 times brighter than a typical LBG, and 
yet can only just be detected at (sub)millimetre wavelengths: a typical LBG 
thus probably has a very low flux 850 — /rm flux density of order 0.1 mJy (see 
Fig. 14). This is consistent with the detection of positive fluctuations in the 
noise from the positions of faint LBGs in the very deep SGUBA 850 — /xm im- 
age of the HDF-N [206], and the statistical detection of net positive 850 — /xm 
flux density from the directions of LBGs in the GUDSS SGUBA survey [265]. 
The deepest reliable 850 — /xm photometry at present reaches down to about 
2mJy for individual objects, but stacking large numbers of images to obtain 
deeper limits for populations seems reliable. 

There is certainly a Universal multi-wavelength luminosity function de- 
scribing the galaxy population at high-redshifts: one of the key goals of ob- 
servational cosmology is to determine it. However, existing submillimetre 
instruments are not sufficiently sensitive to detect the bulk of the optically- 
selected population, while current optical observations are not deep enough 
to detect submillimetre galaxies in large numbers. This is probably due to 
the submillimetre galaxies being deeply extinguished rather than to them 
being at very great distances, based on the non-detection of three example 
to deep 3 — CT limit of — 21.9 using VLT [92]. Other evidence is provided 
by the possible detection of a very red counterpart to the brightest submil- 
limetre source in the HDF-N at Kg — 23.5 [106] using the Subaru telescope, 
and the detection of red iL-band counterparts to most submillimetre galaxies 
observed at depths as deep as AT ~ 24 at 2 — ct using the Keck-NIRG camera 
[127]. 

Deeper surveys from each direction, exploiting both JWST to image sub- 
millimetre galaxies and ALMA to image the optical population should resolve 
this question directly by 2010. The deepest images from both instruments 
could extend current sensitivity limits by at least 2 orders of magnitude. 
Note, however, that it is possible that many LBGs contain little dust, and 
so ALMA may be hard pressed to detect them - many LBGs may just not 
be dusty. The same is unlikely to be true for the the detection of the optical 
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emission from the reddest and most distant submillimetre galaxies for JWST ; 
some of the SCUBA galaxies could have /-band magnitudes in the mid 30’s 
if they are intrinsically red and located at z > 3 [95] . 

6.6 Gravitational Lensing 

Gravitational lensing provides powerful evidence for the presence of dark mat- 
ter and for the accuracy of General Relativity, especially in well-constrained 
sets of images, where the image geometries are securely determined and easy 
to interpret, such as an Einstein ring of emission from a large source behind 
an almost circular lens. It is also a useful tool to detect galaxies that would 
otherwise be too faint to observe: a gravitational telescope. A description of 
the extensive formalism developed to treat gravitational lensing can be found 
in [230,27]. 

General properties of gravitational lensing are the existence of a weak all- 
pervading cosmic shear field, induced by the small deflections of light from all 
mass inhomogeneities along the line of sight to distant objects. Cosmic shear 
distorts the images of background galaxies, without magnification, modify- 
ing their axis ratios by several percent at high redshifts. The deflection is 
cumulative, but only by of order 10 milliarcsec. In the last two years this sig- 
nal been detected in large, deep, high-resolution optical images, for example 
[16,189]. It is currently well beyond the capability of submillimetre and far-IR 
instruments to detect cosmic shear, as this requires high-resolution and large 
samples of galaxies. Weak shear cannot be measured unambiguously on an 
object by object basis; however, there are hopes for the future: ALMA will 
have the sensitivity and angular resolution to measure the shapes of high- 
redshift galaxies very accurately, and by measuring molecular rotation line 
emission, could trace out their rotation curves. Cosmic shear distorts galax- 
ies in a unique way that can be distinguished from a rotation and change 
of inclination, and so deep ALMA observations could perhaps measure shear 
accurately from a small sample of galaxies [47]. 

On top of the sea of weak cosmic shear distortions, individual mass con- 
centrations produce greater distortions, and in some cases, large magnifica- 
tions and multiple images. Light is deflected by a lens by an angle a that 
depends on the mass enclosed within the impact parameter b of the deflected 
ray M(< b) divided by 6, or equivalently on the circular velocity or velocity 
dispersion of the lens at a radius b. a thus generally depends on the angle 
between lens and image on the sky 6i. The general link between image and 
undeflected source positions as compared with the lens, 0\ and Os respec- 
tively, is established by the lens equation, which for the often reasonable 
approximation of a circular lens is. 
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The angular diameter distances between the observer and source and between 
lens and source are Dg and Z?ls respectively. The magnification obtained can 
be approximated accurately by ^ = (0i/0s)(d0i/d6ls). For singular isother- 
mal spheres with a density profile p oc a is independent of the impact 
parameter. A galaxy with an isothermal halo and 300kms“^ velocity disper- 
sion defiects light by 2.2arcsec [230]. This leads to a characteristic Einstein 
radius 0 e for the formation of a ring image around a circular lens when lens 
and source are exactly aligned {Og = 0), with Oe = aDi^g/Dg. Einstein radii 
for galaxies are of the order of an arcsecond. For clusters, which have cir- 
cular velocities about 5 times greater, the corresponding angle is of order 
0.5 — 1 arcmin. 

As the angular separation between lens and source is reduced, into the 
regime where lensing magnification becomes significant, the image is first 
sheared into a magnified arclet with an increasing elongation. Somewhere 
close to a magnification /i = 2, multiple images form, with separations com- 
parable to the Einstein radius, if the projected surface density of the lens on 
the sky exceeds a distance-dependent critical value. For a distant lens this is 
comparable to all the mass in an outward-pointing section through an 17 = 1 
Universe projected onto a single plane at z = 1. Most moderate redshift 
galaxies and clusters exceed the critical surface density, whereas most lower 
density groups of galaxies do not. 

Here we will discuss gravitational lensing as a tool to learn more about 
distant galaxies at long wavelengths. The spectacular internal structure visi- 
ble in lensed images of galaxies from HST [166] cannot yet be reproduced at 
millimetre-far-IR wavelengths. The best that can be achieved is arcsec reso- 
lution using millimetre-wave interferometers, sufficient to resolve the multiple 
images of strongly lensed QSOs [4,276]. 

The exploitation of gravitational lenses for the study of the evolution of 
dusty, gas-rich galaxies has so far been confined to a small sub-region of 
all lensing studies, in which distant strongly-magnified galaxies have been 
detected when their unmagnified counterparts would not have been at the 
sensitivity limits of existing surveys. This includes the brilliant IRAS galaxies 
F10214-H4726 [223,234] and APM 08279-k5255 [152], at redshifts z = 2.3 and 
3.8 respectively, much greater than the bulk of the IRAS population, the 
LEG cB58 [115], and a triply-imaged background galaxy in the field of the 
cluster A 2218 [170], all parts of which are readily detected in a deep SCUBA 
image. 

As angular resolution and multi-waveband coverage is at a premium at 
long wavelengths, the ability to examine lensed sources with large magnifi- 
cations is very useful. Many of the galaxies with SEDs plotted in Fig. 1 have 
multiple datapoints only because of gravitational lensing. An interesting fea- 
ture is that some of these galaxies appear to have unusually fiat mid-IR SEDs. 
This may be due in part to selection effects, such as bright optical emission 
being essential for their detection, identification or recognition [152]; how- 
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ever, it could also reflect the spatial structure of the dust emission within the 
objects. Hotter dust emission might be expected to be more compact, and 
thus potentially lead to a greater magnification [113,44]. 

The effects of source confusion noise can also be reduced by lensing, which 
both separates galaxies on the sky, making them easier to resolve apart, and 
makes them brighter, and thus easier to detect. ISO and SCUBA images in 
the lensed fields of rich clusters of galaxies [7,42,240,88] are effectively deeper 
than those in the held, taking advantage of this reduction of confusion noise. 
This is especially useful at these wavelengths, where potentially confusing 
emission from passively evolving cluster galaxies is relatively weak, and the 
available angular resolution is relatively coarse. The Einstein radii of clusters 
subtend angular scales of order 1 arcmin at z ~ 0.2 — 0.4, well matched to the 
fields of view of the SCUBA, MAMBO and SHARC-II (sub)millimetre-wave 
cameras. 

The effects of lensing on the detectable population of galaxies depends on 
the from of the galaxy counts. Lensing conserves surface brightness: it neither 
creates nor destroys photons, and does not change their frequency. Hence, 
the total intensity of background radiation from the sky is not modified by 
lensing. This condition leads to a modification of the counts. A magnification 
/i changes a differential count 
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Hence, if the slope of a differential count a is steeper than —2, corresponding 
to an index steeper than —1 for the integral counts shown in Fig. 14, then 
B > 1, and the magnification bias is positive for positive magnifications 
/i > 1, and the number of detectable sources is increased. At submillimetre 
wavelengths the counts can be very steep, and have slopes that change rapidly 
with increasing depth (Fig. 15), and so the magnification bias can be very 
significant, especially at the brightest flux densities. 

Magnification bias at these wavelengths is likely to remain an impor- 
tant effect in the future, helping to overcome limitations of angular reso- 
lution to study deep fields and the internal structure of sources. Relatively 
shallow wide-held surveys in these wavebands, that probe the steep bright 
counts should reveal many very luminous lenses. The first examples should 
come from large-format ground-based (sub) millimetre- wave cameras includ- 
ing BOLOCAM, 100-deg^ surveys from Herschel and the Planck Surveyor 
all-sky survey [43]. 
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Flux Density / mJy 

Fig. 15. A comparison of the counts of galaxies at optical, near-IR, far-IR and 
submillimetre wavelengths. The thick and thin lines correspond to the different 
models shown by the heavy lines in Fig. 14. The slopes of the lines are overplotted 
to show where magnification bias is likely to be large. The submillimetre counts 
almost certainly change slope through the significant a = —2 value for which the 
magnification bias changes sign at the depths that can be probed by ALMA in 
a reasonable time, reflecting that mJy sources generate a significant fraction of 
the background radiation intensity and the faintest counts flatten. The difference 
between the slopes of the deep optical and submillimetre counts further emphasizes 
that they provide different information about galaxy evolution. 

Perhaps very close to the centres of clusters of galaxies ALMA can exploit 
the lensing effect to increase the effective field of view of the telescope [46] . If 
the counts are flat, with a > —2 fainter than 100 /rJy, and the cluster has a 
sharply-peaked central density profile, then the swarm of demagnified images 
expected to lie close to the core of a cluster could be detected using ALMA, 
which has the sensitivity to detect these faint images, and the resolution to 
separate them on the sky. 

6.7 The Process of Constraining Galaxy Evolution 
from Submillimetre Observations 

The key non-intuitive flat flux-density-redshift relation (Fig. 10) should al- 
ways be remembered when discussing galaxy evolution from submillimetre- 
wave data. Galaxies at moderate and high redshifts detected in the submil- 
limetre are selected by luminosity not distance, subject to their SED (Fig. 11). 
The surface density of SCUBA galaxies brighter than about 2mJy is about 
0.5 arcmin“^. Each of these, with a 40-K dust SED, has a luminosity of about 
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5 X 10^^ Lq - a very luminous galaxy. If this assumption is wrong, and the 
galaxies had SEDs like the Milky Way, this would still correspond to a lu- 
minosity of about 5 X 10^^ Lq, an order of magnitude greater than the dust 
luminosity of the Milky Way. If the temperature was in fact greater than 
40 K, then the intrinsic luminosity would be increased. The volume in which 
these submillimetre-selected galaxies lie is a very long, thin pencil beam, with 
a great extent in the redshift direction, extending from z ~ 1 to z ~ 5. The 
volume beyond z ~ 5 is relatively small, and so the comoving volume occupied 
by these galaxies is thus of order 10“^ Mpc“^ (Fig. 7, whereas IRAS galaxies 
with this luminosity have a space density < 10“®Mpc“^ at the present day 
(Fig. 4). Hence, regardless of the lifetime or duty cycle of the submillimetre 
galaxies, they evolve very dramatically as compared with their ultraluminous 
counterparts at low redshifts, being at least about 200 times more numerous 
in terms of space density at z > 1. 

Without redshifts for a large number of submillimetre galaxies, it is dif- 
ficult to be certain of the role they play in the process of galaxy formation 
[192,240,178]. Much has been discussed about their potential identification 
with the epoch of formation of the spheroidal components of galaxies. Since 
most of the stars in the Universe are in elliptical galaxies [130], these ar- 
guments implicitly accept the cosmological significance of the submillimetre 
population. However, it is possible to make several simple arguments which 
leave this question open, and have a significant impact on our understanding 
of galaxy evolution. 



Elliptical Galaxies: The space density of high-redshift SCUBA galaxies is 
at least 3 orders of magnitude less than that of L* galaxies at the present 
epoch, and similar to the space density of giant elliptical galaxies [105,109]. 
This could indicate that every submillimetre galaxy is destined to be a giant 
elliptical galaxy at the present epoch, and that only giant ellipticals formed 
in this way. However, less dramatically, a much greater fraction of galaxies 
could go through a submillimetre luminous phase, perhaps during the most 
significant merger event in their history. If this phase lasts only about 0.1% of 
the Hubble time, then this would be compatible without generating too many 
submillimetre sources. It is thus possible that all galaxies could be visible in 
a brief submillimetre-luminous phase, and then quietly slip back onto the 
ordinary luminosity function. These numerous brief bursts of activity also do 
not exceed the measured background radiation intensity. If brief enough, then 
the number of galaxies undergoing this phase would by chance be consistent 
with the observed space density of giant ellipticals. The necessary duration of 
every galaxy’s submillimetre phase would have to be 10^ years, which is not an 
unreasonable duration for an intense, centrally condensed burst of activity 
in a massive galaxy, corresponding approximately to the Galactic rotation 
period. It is reasonable for at least a significant minority of all galaxies to 
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undergo a submillimetre luminous phase at some stage in their evolution, or 
for the galaxies to be extremely rare. 

The correspondence of the apparent space density of submillimetre galax- 
ies and low-redshift giant elliptical galaxies is interesting, but it could simply 
be coincidence. The stars in giant ellipticals are very old, but could have 
formed in a series of mergers through a relatively short interval at high red- 
shifts, each of which may not have been luminous enough to be detected 
as a submillimetre galaxy. These galaxy sub-units would subsequently con- 
gregate together at lower redshifts to make the elliptical galaxy [248]. The 
star-formation process would have to deplete enough gas to prevent substan- 
tial low-redshift star-formation activity, while not disturbing the colours of 
the galaxies. ERO samples with AT ~ 20 present plausible evidence that many 
elliptical galaxies are old and in place at 2 ; ~ 1, but cannot reveal whether 
their formation involved a submillimetre bright phase. 



Elliptical Galaxies and AGN: Perhaps the best argument in favour of a 
rapid formation timescale for ellipticals comes from the bulge-blackhole mass 
correlation, which is much easier to imagine coming from a common single 
event than a drawn-out series of events in the history of all galaxies. However, 
in that case where are all the typical L* galaxies in submillimetre surveys? 
The blackholes in spiral galaxy bulges and in smaller ellipticals seem to trace 
the stellar mass in the same way as in the giant ellipticals and yet they are not 
represented in existing submillimetre surveys. Is there a conspiracy to gen- 
erate about 50% of the background radiation in galaxies with a very sparse 
spatial distribution that are much brighter than L* at high redshifts, and yet 
for almost all galaxies to have a scaled-down bulge component with a black- 
hole whose properties are linked in the same way? This is quite acceptable: 
only a small fraction (< 1%) of galaxies are in the luminous phase at any 
time, and yet most could have gone through such a phase, generating most of 
their total emitted energy during it. If they turn out to be strongly clustered, 
with a correlation length greater than Mpc, then it perhaps indicates 

that they are associated with the densest, earliest collapsing, most active 
regions in the early Universe. However, if their luminosity is short-lived or 
merger-driven, as in all low-redshift galaxies of comparable luminosity [226], 
then the associated brief flashes of activity are likely to be less correlated. The 
submillimetre galaxies may only be located in the most overdense regions, but 
with a short duty cycle, with only one ever being observed per cluster. The 
test will be a survey sufficiently large to detect a representative number of 
the largest overdensities of submillimetre galaxies on the sky, several square 
degrees in extent. So far the total area surveyed to significant depths by sub- 
millimetre telescopes is less than 0.3 deg^, and no striking overdensities have 
been found. Useful upper limits have been imposed, but no convincing 2-point 
correlation function has been detected [232,265,68]. Some possible examples 
have been found ~ the surface density of submillimetre galaxies around some 
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high-redshift radio galaxies [156] and clusters [41], and associated with one 
dense field of LBGs [74] is about an order of magnitude greater than in the 
field. The nature of the ultimate remnants of the submillimetre galaxies are 
intractable at present, despite some interesting insights [127]. At present, it is 
much easier to address the evolution of the submillimetre galaxy population 
as a whole. 

6.8 The Evolution of the Submillimetre Galaxy Population 

As with other populations, it is necessary to determine possible forms of dusty 
galaxy evolution that can explain the observed counts and background ra- 
diation. As discussed above, the submillimetre galaxies certainly evolve very 
strongly, when compared with galaxies of similar luminosity at the present 
epoch. However, from their contribution to the background radiation intensity 
it is clear that this evolution has to be much closer to pure luminosity evo- 
lution than pure density evolution ~ to boost the space density of detectable 
galaxies by the factor of 200 required to explain the counts, evolution in num- 
ber density by this factor would lead to the measured background radiation 
intensity being greatly exceeded. 

6.9 Simple Forms of Evolution 

The most direct way to fit the results of observations is to take a luminos- 
ity function and choose evolution functions to match. It is important to be 
sure that the model is not too complex, that is the number of parameters 
describing it can be adequately constrained by the available data. 

When looking at the evolution of dusty galaxies, the natural luminosity 
function is that determined from observations of low-redshift IRAS galaxies 
at 60 /im [228,247]. In order to explain the shape of the IRAS galaxy counts 
(Fig. 5) strong evolution is necessary out to the 2 ; ~ 0.1 limit of the IRAS 
survey. Without this evolution, the predicted faint counts would be too few. 
The form of evolution required is similar to the evolution of optically-selected 
galaxies [177] and QSOs [62] at z < 1. The strength of evolution required 
is only weakly dependent on the assumed SEDs of IRAS galaxies, as they 
sample a small redshift range over which the K correction is low, regardless 
of the SED. 

The next piece of information is provided by deep ISO surveys at 175 /xm, 
which probe similar rest-frame wavelengths, reaching out to z ~ 1. At 15 /xm 
the SEDs of IRAS galaxies include spectral lines and small-grain PAH emis- 
sion/absorption features. The counts at 15 /xm probe out to similar redshifts, 
and also show strong evolution. As 15 and 60 /xm are on the same side of 
the peak of the SED, the relative counts provide little information about the 
temperature of the dust. However, by combining an evolution function with a 
simple single-temperature SED, the IRAS and 175 — ^m ISO results can be 
seen to be compatible with a 40-K temperature and luminosity evolution of 
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the form + (Fig. 6). This temperature is significantly higher than that 
in quiescent galaxies [219], but agrees with values measured from submillime- 
tre observations of low-redshift IRAS galaxies [107], from some high-redshift 
dusty galaxies, for which SED measurements exist on both sides of the rest- 
frame peak [154,155] (see Fig. 1), and from other multiwavelength investiga- 
tions of evolution [261]. It is also possible to explain the form of low-redshift 
evolution by pure density evolution; however, the redshift dependence must 
then be steeper, as (1 -I- z)-'’ , and must not continue to the redshifts of the 
ISO galaxies, or else the radiation intensity would exceed that observed by 
at least an order of magnitude. Whatever form of evolution applies, it must 
appear similar to pure luminosity evolution. 

The existence of more distant galaxies was not probed by the ISO surveys, 
which reached the confusion limit of a 0.6-m antenna with a detector that 
undersampled the telescope point-spread function, and covered only small 
areas of the sky. To extend the available information to higher redshifts, 
observations at longer wavelengths are required. Both the spectrum of back- 
ground radiation (Fig. 16) and the counts of submillimetre galaxies (Fig. 14) 
provide this information. The K correction allows submillimetre galaxies to 
contribute to the counts out to the highest redshifts; however, while the back- 
ground spectrum incorporates the emission from all redshifts, but the effect of 
redshifting the spectrum and diluting it by a factor (l-l-z) ((4)), ensures that 
it is much more sensitive to activity at z ~ 1 than at z ~ 5. The constraints 
on a form of high-redshift evolution imposed by these data are illustrated 
in Fig. 17. The ‘anvil’ model used includes luminosity evolution of the form 
g{z) = (H-z)-^ ® out to redshift Zmax, followed by a constant luminosity func- 
tion modified by the function g{z) = (1 -I- Zmax)^'® for Zmax < z: < Zq, with 
no galaxies beyond zq . The requirement to match both the 850 — gm galaxy 
counts and the background spectrum at wavelengths longer than 200 gm con- 
strain a track through the (zmax) zq) parameter space in similar directions. 
There is no unique best-fit value. This is a generic feature of all evolution 
functions, including models without either a sharp change of slope or a sharp 
cutoff. 

This degeneracy remains even for very accurate submillimetre counts. The 
width of the permitted swath in parameter space is reduced if the error bars 
are tightened, but no unique clear maximum emerges. Points along the best- 
fitting track correspond to models in which strong evolution takes place out to 
an increasing value of Zmax, but which is traded off against a reduced upper 
limit for the population zq. At all points on the track, the total amount 
of energy generated, the evidence of which now resides in low-mass stars, 
stellar remnants, metals or supermassive blackholes, agrees with observations, 
subject to a stellar initial mass function (IMF) that has a low-mass cutoff at 
~ 1 Mq, and a metallicity in the Universe that is raised to close to solar values 
globally, in the absence of a contribution from AGN. Based on the bulge- 
blackhole mass ratio, star-formation activity is likely to dominate AGN power 
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Fig. 16. Various measurements of and limits to the background radiation intensity 
from galaxies, and the spectrum of the CMB, from the radio to UV wavebands, 
plotted in units of equal energy per logarithmic frequency interval. The nnmbers 
refer to different observations: a detailed key can be found in [57]. Over most of the 
range where dust emission contributes strongly to the background spectrum, from 
3mm-10/im, the intensity of the background has been measured to an accuracy 
within a factor better than about 3, generally in the last 5 years. COBE, ISO and 
SCUBA observations in the 1990’s were essential for providing these measurements. 
Note that the energy density in the CMB is about 100 times greater than the total 
included in all the measurements shown. The background intensity in the X-ray 
waveband is the only significant component that is not shown, making up a few 
percent of the total non-CMB background intensity. 



by a factor of about 5 [51]. Further details can be found in [56], while similar 
models have been discussed elsewhere [224,80]. 

Any reasonable current form for the evolution of dusty galaxies is shown 
in Fig. 24, in which the evolution function 

g{z) = (1 + sech ^ [6 ln(l + z) — c] cosh^ c , (32) 

with 6 ~ 1.9 and c ~ 1.8. This functional form accounts for all the most 
current data shown in Figs. 14 & 16 and what is known about the redshift 
distribution of the submillimetre sources [243] . By using this more adaptable 
evolution function, a less degenerate result can be obtained: see Fig. 18, in 
which the values of parameters b and c that can account for the data are 
constrained to lie within a more compact region of parameter space. 
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Fig. 17. The relative importance of different high-redshift information for con- 
straining a model of galaxy evolution [56]. The anvil model is used for illustration. 
The data is for the far-IR/submillimetre-wave cosmic background radiation mea- 
sured from COBF-DIRBE and FIRAS [122], and from the first measurement of the 
counts of submillimetre-selected galaxies [240]. The parameters are the redshifts 
at which (1 -|- luminosity evolution ceases, «max, and at which the popula- 
tion ceases to be present beyond zq. Between Zmax and zq the evolution function 
is (1 -I- 2o)^'^. The constraints from the submillimetre-wave counts are tighter than 
from the background determination; however, they provide a constrained swath of 
parameter space rather than a peak probability. This is a generic feature of the 
models, and the reason that breaking the counts into a redshift distribution is a 
crucial step for making progress. 



Note that none of this discussion includes any details of the physical 
processes that generate the evolution or luminosity. This is necessary for a 
full understanding of the situation, but it is important to remember that any 
such model must predict a luminosity function that evolves in a similar way 
to that derived here. 

6.10 Caveats 

The process described here assumes a constant SED for the submillimetre 
galaxies, and that their luminosity function at high redshifts can be described 
by the form observed for IRAS at low redshifts, subject to luminosity evo- 
lution. These assumptions appear to be reasonable based on existing data, 
but need further testing. In particular, as discussed above in the context of 
observational selection effects, systematic changes of either the typical dust 
temperature or the width of the range of temperatures with increasing red- 
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Fig. 18. The restrictions on parameters b, c (Equ. (32)) required to account for 
different types of data. In the left panel the ±<t errors on counts are shown by 
the dashed and solid lines: thick solid - submillimetre at 850 /rm; thin solid far- 
infrared at 175 ^m and dashed radio at 4.8 GHz. The dotted lines show the 1-, 2- 
and 3 — (7 quality of fit to the background spectrum (Fig. 16) in order of decreasing 
line thickness. The right panel shows the combined probability contours spaced in 
units of a away from the best-fit value shown by the encircled star. The minimum 
value of = 4.1. 



shift could easily lead to errors of interpretation. These issues have been 
discussed and simulated by Bales et al. [109]. If high-redshift galaxies had 
systematically cooler dust temperatures, then less dramatic evolution would 
be required to account for their properties. This would be a real concern if the 
value of T ~ 40K that accounts for the low-redshift data were greater. How- 
ever, at present it seems reasonable that this temperature remains reasonable 
for the cores of luminous dust-enshrouded galaxies out to high redshifts. No 
very luminous galaxies containing only cold dust are known - the coolest ISO 
selected galaxies at moderate redshifts have temperatures T ~ 30K [76]. The 
mass of dust in a cold, very luminous galaxy would also have to be much 
greater than in a hotter galaxy with the same luminosity. At some point the 
available metallicity imposes a lower limit to the dust temperature for a given 
luminosity [56]. 

6.11 Redshift Distributions 

The way to break the degeneracies between different forms of evolution is 
to determine the redshift distribution of a sample of submillimetre galaxies 
[56]. By comparison, improving error bars on the counts and backgrounds, 
or finding samples at other submillimetre wavelengths does not narrow the 
track through the parameter space shown in Fig. 17 dramatically. The redshift 
distributions predicted by some plausible models for the evolution of galaxies 
based on counts and background as described above are compared in Fig. 19. 
This includes all the models described in [56]. Note that the anvil models 
tend to provide better fits to current data. A revised version of the plot, 
including the best fitting ‘Anvil-5’ model, and the new best-fitting sech^ 
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Fig. 19. The redshift distribution predicted for galaxies selected using SCUBA to 
a depth of 2mJy at 850 pm in a variety of models that provide reasonable account 
of the far-IR/submillimetre-wave counts and background. The model predicts these 
quantities to within about a factor of 2 of each other [56], with the exception of 
the two thinnest dashed lines, which predict counts that are higher than the cloud 
of current results shown in Fig. 14. The dotted line corresponds to a model with a 
burst of activity with a Gaussian profile 0.8-Gyr wide in time centred at z = 2.6: 
see the thin dotted line in Fig. 24. The solid lines correspond to ‘anvil models’, with 
a sharp change of slope in the evolution function, and the dashed lines to smooth 
‘peak models’. The significant spread in redshift distributions emphasizes the im- 
portance of measuring the distribution to discriminate between the models. The 
observational determination of a complete redshift distribution for submillimetre 
galaxies is a vital goal. No data is plotted, as it is not yet possible to provide a reli- 
able estimate of the redshift distribution. Radio-selected surveys detect up to 70% 
of the submillimetre galaxies detected in direct submillimetre surveys [24,72] should 
soon provide reasonable redshift statistics. Based on the flux density-redshift rela- 
tions shown in Fig. 10, most submillimetre galaxies are unlikely to be at z 3. Deep 
radio observations of submillimetre galaxies [242,77] indicate a median redshift in 
the range 2 < z < 3. There is no evidence for a significant tail of submillimetre- 
selected galaxies at z < I [243] where the K correction is less favourable. The 
models represented by the two thinner solid lines and the dotted line provide the 
best description of the redshift distribution known at present. 



model ((32)) are compared at selected interesting depths and wavelengths in 
Fig. 20. This shows both the range of possible redshift distributions that can 
be accommodated at 850 /rm, especially concerning the number of galaxies at 
low (as defined in the submillimetre - z < 1) and very high (z > 5) redshifts. 
It is difficult to determine the redshift distribution [243]; however, to be sure 
of the form of high-redshift galaxy evolution, it must be found. 
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Redshift 

Fig. 20. The predicted redshift distributions of deep mid-IR, far-IR and submil- 
limetre galaxies at sensitivities that correspond to the maximum achieved using ISO 
at 15 and 175 /rm, SCUBA at 850 /im, and practical depths using the forthcoming 
350 — /rm SHARC-II camera at CSO. One of the models from Fig. 19 - ‘Anvil-5’, 
with strong evolution to z = 1.8 and then a constant luminosity function to a cut- 
off redshift 2 = 5- still accounts well for the observations {dashed lines), and is 
compared with a more recent, smoothly varying model [57,243] {solid lines). The 
differences between the predictions are most striking at high redshifts and at long 
wavelengths - the numbers of 850 — /rm detected galaxies at redshifts approaching 
5 are expected to be much greater in the 1999 Anvil-5 model. The determination of 
the redshift distribution of submillimetre galaxies will readily distinguish between 
these models, despite their very similar counts and backgrounds. 



Radio detection of counterparts to submillimetre galaxies has helped con- 
strain redshifts [67,242,274], including samples selected first as optically-faint 
radio galaxies, and only then observed at submillimetre wavelengths [24,72], 
to allow more rapid sample compilations than in blank-field submillimetre 
surveys. Deep near-IR follow-up observations appear to identify a significant 
fraction of submillimetre galaxies at AT ~ 22 — 23 The problem is that cur- 
rent limits for detecting the continuum from these galaxies in spectroscopic 
observations generally require AT ~ 20 or less [92,106,127]. If high-redshift 
galaxy targets have strong emission lines, then fainter galaxies with AT ~ 21.5 
could be observed, but the faintest of these objects will probably only yield 
redshifts using a future 30-m ground-based telescopes or JWST. Searches for 

Relevant observations were carried out (during European daylight while we skied) 
using the NIRC camera at the Keck telescope in Hawaii [127]. Thanks to collab- 
orators Lee Armus, Dave Prayer, Naveen Reddy and Nick Scoville. 
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CO emission to determine and confirm redshifts absolutely are also possible 
[73]. 

In a remarkable successful run of observations also coincident with the 
Saas Fee school, a radio pre-selected submillimetre galaxy sample [72] was 
observed using the new blue-sensitive arm of the LRIS spectrograph at Keck, 
and a crop of over 20 redshifts for radio-submillimetre-selected galaxies were 
found [77]. The associated spectra provide evidence for AGN features in a 
minority of the sources, and evidence for outflowing absorbing gas [244] in 
a majority [78]. Optical multi-object spectroscopy targeting accurate radio 
positions appear to be the most efficient way of determining redshifts for 
submillimetre galaxies. 

6.12 Semi-analytical Modelling 

As there is no certain redshift distribution for submillimetre-selected galax- 
ies, despite the progress on faint radio pre-selected samples, only so much 
can be determined about their evolution. There are a variety of models that 
describe their properties. Some of the most interesting attempts integrate 
semi-analytical galaxy formation models, that combine the results of N-body 
simulations of the gravitational growth of structure with generally simple 
recipes for the physics that describes the cooling of gas in the galaxies, and 
the subsequent star-formation and feedback processes. A look into the multi- 
phase, very optically thick, dynamically-shredded ISM of even the relatively 
low-luminosity Antennae galaxies [271] should perhaps caution against treat- 
ing these processes too simply. This empirical caution is supported by the 
results of gas dynamical simulations to trace the evolution of gas in merg- 
ing galaxies, in which a rich array of complex, short-lived, rapidly variable 
behaviour can be seen [195,31]: the results depend strongly on the initial 
conditions of the galaxy mergers - the impact parameter, energies, relative 
inclination etc. 

The first attempt to incorporate details of the astrophysics of forming 
galaxies to explain the submillimetre-wave counts and background was made 
by Guiderdoni et al. [143], as the first results of surveys for high-redshift 
dusty galaxies arrived in 1997. The standard recipes invoked did not predict 
enough very luminous systems. These were legitimately added as a separate 
population of galaxies (in their E model) to account for the data. Semi- 
analytical models [267,163,248,35] had traditionally accounted for the growth 
of stellar populations out of cooling gas that settles in virialized halos to form 
rotationally supported disks. If this additional population is due either to the 
collapse of low-angular-momentum haloes or the violent mergers of galaxies, 
then their properties lie outside this realm of gradual, regular activity. In 
particular, most recipes involve a close link between mass and luminosity, 
with the most luminous objects, especially giant elliptical galaxies forming 
late in the Universe’s history when halo masses grow large enough. This does 
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not necessarily mean that the stars within formed late, just that the galaxies 
assembled late. 

More recently, another semi-analytic model, which includes a more effi- 
cient mode of cooling of gas in the deepest potential wells, thus generating 
more rapid evolution and more luminous, but shorter-lived galaxies, has been 
proposed by Granato et al. [142]. This approach explicitly includes the for- 
mation and fueling of AGN, which are not typically involved in semi-analytic 
models. The AGN luminosity increases exponentially through the luminous 
phase of its duty cycle, but the star-formation activity is more smoothly dis- 
tributed in time. Hence, later in the duty cycle a massive galaxy would be 
more likely to be recognized as an optical QSO, but is more likely to be de- 
tected as a star-formation powered object at much earlier times. Feedback 
from winds generated by supernovae and by the AGN strip fuel to termi- 
nate star formation activity and AGN accretion. This feedback is expected 
to be less effective in suppressing activity in deeper potential wells, leading 
to a ‘reversed hierarchical’ picture in which the most massive galaxies form 
in single events, relatively early, in regions destined to be rich groups and 
clusters of galaxies at late times. By construction, this model accounts for 
the Magorrian relation between the mass of supermassive blackholes and the 
velocity dispersion of the surrounding stars [188]. A similar explanation of 
very powerful radio galaxies [12], in which dust-enshrouded star-formation 
can only take place in young systems that the AGN has not grown large 
enough to disrupt. 

Other scenarios are also possible in which winds driven by high-mass stars 
rather than by AGN drive out gas from a galaxy to prevent further accretion 
and/or star formation [239], or consume all the available gas into stars, again 
preventing further activity. Perhaps dark-matter could also accrete onto a 
seed blackhole at a low rate in a manner determined by the halo potential, 
gradually and inconspicuously generating the observed bulge-blackhole cor- 
relation over a significant fraction of a Hubble time. For the most recent 
discussion of the links between star formation activity and AGN see [264]. 

Without a direct, detailed time-series of observations of a large number of 
elliptical galaxies in the process of formation, or a detailed time sequence of 
galaxy mergers, providing a time-lapse image of the velocity and constitution 
of the gaseous, stellar and dark matter components of these objects, then 
the issues of the power source of the luminous dusty high-redshift galaxies 
are unlikely to be settled soon. At present, the early formation of elliptical- 
like stellar systems in Granato et al.’s picture [142] seems to be more in 
keeping with the detection of quiescent EROs at z ~ 1, than more gradual 
formation pictures that tend to have problems generating both the required 
numbers of high-luminosity, high-redshift dusty galaxies, and the associated 
evolved remnant EROs at z ~ 1. Whether it can account for the evolution of 
dusty galaxies awaits measurements of spectra and masses for submillimetre 
galaxies. 
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The results of different models all account for the published data, this 
usually being a necessary requirement for referees to accept papers and for 
editors to publish them! However, differences between the redshift distribu- 
tions predicted by various models are very valuable for learning about galaxy 
formation once more data becomes available. 

6.13 Background Radiation Constraints 

The background radiation intensity at far-IR wavelengths is now determined 
reasonably accurately (Fig. 16). There should be relatively little freedom re- 
maining to modify the form of evolution described dramatically and still 
reproduce the results. One important area for future development is the use 
of 77 — >■ e^e“ opacity between CMB/infrared background photons and TeV 
energy cosmic-ray photons to probe the radiation field. The interaction cross 
section for photons peaks sharply just above the electron-positron pair cre- 
ation energy of 1.02 Me V, and corresponds to the interaction of a low-energy 
far-IR background photon with a wavelength in /rm that is equal to the nu- 
merical value of the energy of a high-energy photon in TeV. The optical depth 
is such that observations of downward curvature in the measured 7 -ray SEDs 
of low-redshift blazars are interpreted as being due to 7 -ray attenuation by 
this effect. So far, the limits imposed by this method [251] are subject to un- 
certainties based on lack of knowledge of the intrinsic 7 -ray SEDs of blazars, 
and to small numbers of both detectable blazars (2 so far) and numbers of 
detected photons from each [146]. However, more capable ultrahigh energy 
7 -ray observatories are coming into service, and the results should allow an 
accurate spectrum of the mid- and far-IR background radiation intensity to 
be determined from higher-quality observations of a larger sample of blazars, 
less susceptible to concerns about rapid variability modifying the spectrum. 

6.14 Constraints from Bright Counts 

The counts of distant dusty galaxies predicted by a variety of models gener- 
ally agree with the faint few-mJy counts of submillimetre galaxies determined 
by SCUBA and MAMBO surveys [243,57] and the 100-mJy counts of far-IR 
galaxies detected by ISO at 175 /xm. However, one of the most interesting 
questions is what happens at the brighter flux densities. (Fainter probes are 
currently not possible due to the limitations of resolution, and hence source 
confusion.) As a result of the strong K correction at submillimetre wave- 
lengths, the deep SCUBA counts mostly consist of high-redshift galaxies, 
with a relatively small contribution, perhaps 10 %, from galaxies at redshifts 
z < 1. However, there are luminous low-redshift dusty galaxies with 60 — /xm 
flux densities up to about 100 Jy, corresponding to about 1 Jy at 850 yxm. By 
moving to brighter flux densities than are probed by existing surveys, both 
more luminous distant galaxies and more proximate galaxies will begin to 
influence the counts. As high-redshift galaxies have submillimetre fluxes that 
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are almost independent of distance, the slope of the submillimetre counts at 
flux densities greater than about 10 mJy should correspond quite accurately 
to the high-redshift submillimetre luminosity function. To ensure convergence 
to the measured background intensity, this must decline more rapidly than 
oc where a < —2. Low redshift IRAS galaxies have a power-law lu- 
minosity function with a ~ 2 at the highest luminosities, and this could 
conceivably continue out to the high redshifts. This slope of the luminos- 
ity function reflects processes taking place in the most luminous, short-lived, 
phases of galaxy evolution, and so a direct determination of its form is sure to 
be important. However, mass functions of bound objects are predicted to have 
exponential forms at the highest masses. As a result, a cutoff mass/luminosity 
might be expected in the distribution of high-redshift galaxies. How does this 
interact with the observed more gradual decline of the luminosity function of 
the most luminous low-redshift galaxies? The decline of the high-redshift 10- 
100-mJy 850 — fxm counts and the flux density above which the low-redshift 
Euclidean population of less exotic galaxies should start to dominate the 
counts is very uncertain. In some models [224] a large density of cool low- 
redshift galaxies are included, in addition to the extrapolations of the IRAS 
60 — ^m luminosity function to longer wavelengths using the SEDs deter- 
mined for samples of 60 — /xm-selected IRAS galaxies [107,108]. In others, 
a precipitous drop is predicted [142] over a narrow range of flux densities. 
These models should be easy to discriminate using the bright counts at flux 
densities of several 100 mJy that should be detected at three submillimetre 
wavelengths by the Planck Surveyor space mission [43]. 

To recap, the most sensitive indicator of the concordance between a model 
and data is the predicted redshift distribution of the sources: as we have seen, 
shifting activity to higher redshifts reduces the intensity of the background 
radiation associated with the generation of a certain amount of energy, by 
a factor of 1 -I- 2, while having little effect on the counts. The redshift dis- 
tribution of submillimetre galaxies can be affected in a radical way, however 
(see Fig. 19). Published predictions of redshift distributions are not available 
for all the models listed, as observational results concerning the distribution 
for submillimetre galaxies selected without the aid of radio information are 
sparse. A picture of a radio-selected sample with a median redshift 2 ~ 2.5 is 
starting to emerge [242,77]. Be aware that the predicted distributions can be 
biased to unrealistically high redshifts, especially in models where the evolu- 
tion of luminosity density in the Universe remain flat over long intervals of 
redshift beyond z ~ 1 (Fig. 24). 

7 Properties of High-Redshift Dusty Galaxies 

The submillimetre-selected galaxies are a key component of the distant Uni- 
verse, and are likely to have contributed a significant fraction of the energy 
released by galaxies over the history of the Universe. They have only been 
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known since 1997, when the SCUBA camera first detected them, and at 
present it is challenging to study large or complete samples at other wave- 
lengths [243]; many are extremely faint with optical magnitudes / > 26, and 
it is difficult to identify more than 50% in even the deepest images. Part of 
this is perhaps due to their uncertain positions to about 5-10 arcsec, with can- 
didates offset in position, but areas extending 15 arcsec have been searched 
for counterparts. A clear example is the 8-mJy SCUBA galaxy in the Hubble 
Deep Field [101,150], whose identification remains enigmatic, and could be 
associated with an extremely faint red galaxy [106]. 

The submillimetre galaxies can genuinely be called part of the cool Uni- 
verse: even the most extreme example APM 08279-1-5255, an /i?A5'-selected 
2 : = 3.8 i? ~ 15 red QSO [152] has a dust temperature of about 110 K. 
Although there is a significant contribution to the total luminosity of this 
galaxy shortward of mid-IR wavelengths, from both very hot dust and an 
AGN accretion disk, it is cool dust that makes it shine so brightly at far- 
IR wavelengths. There is an excellent chance that this most extreme object, 
which is magnified by a factor of several tens by an intervening lensing galaxy, 
has hot inner regions that are magnified by a greater amount than the cooler 
outer regions, boosting its optical and near-IR emission [176]. However, this 
would lead to an overestimate of its dust temperature by only about 20% 
[44]. 

It is possible, but far from certain, that submillimetre galaxies correspond 
to forming elliptical galaxies [109,178]. In favour of the argument is the fac- 
tor that only of order 0.1% of the galaxies in the Universe appear to be 
submillimetre luminous, comparable to the fractional density of giant ellip- 
tical galaxies at the present epoch. The inferred luminosities and gas masses 
of the submillimetre galaxies also correspond to the consumption of gas on 
a sub-Cyr timescale at high redshifts, sufficient to generate a homogeneous 
and uniformly old stellar population typical of elliptical galaxies. However, 
it is unclear that elliptical galaxies are as old as their stellar populations, 
and they could have assembled at lower redshifts. It is not at all clear that 
the submillimetre-selected galaxies that have been studied in detail are des- 
tined to become elliptical galaxies. They certainly look destined to have an 
appropriately large stellar mass, but at z = 2.55 and z = 2.81 they are not 
at very great redshifts (see Figs. 21 [154,124,263] and 22 [73,155,157]). The 
well-studied examples are also unusually bright at optical wavelengths, being 
over 6 magnitudes brighter than most submillimetre-selected galaxies [243]. 
They are relatively easy to identify and study at other wavelengths, but may 
not reflect the properties of the other 90% of the population. 

Figure 23 shows a cautionary, and perhaps more realistic, example of an 
identification of an example of the submillimetre population. Without ob- 
taining radio and near-IR images of the field this galaxy would not have 
been identified. It appears to be associated with an ERO, rather than with 
the edge-on spiral galaxy nearby that was considered as an early plausible 
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Fig. 21. The first identified submillimetre galaxy, SMMJ02399-0136 behind the 
cluster Abell 370 [240]. The four images show a variety of multiwaveband data. 
White contours represent flux levels in the saturated parts of images. North is up 
and east to the left. The irregular pair of merging galaxies near the centroid of 
the submillimetre emission, represented by contours in the 30x30-arcsec far-left 
image, was targeted for optical spectroscopy using the 4-m Canada-France-Hawaii 
Telescope (CFHT) in late 1997, a few months after the submillimetre galaxy was 
detected [154], revealed a redshift 2 = 2.80 and a clear AGN spectral signature in 
the brightest westerly component. X-ray observations from the Chandra satellite 
[28] reveal that the galaxy definitely contains an accreting AGN. The detection of 
redshifted CO (3-2) emission at the same position and redshift using the OVRO 
Millimeter Array [124] confirmed absolutely that the counterpart was correct, and 
revealing a large mass of molecular gas. Subsequent spectroscopy from the ESO 
VLT [263] has revealed a halo of Lyman-a emission extending over 13 arcsec. Radio 
emission (black contours in the 30x30-arcsec second panel) from the galaxy provides 
a better location for the submillimetre emission. The Abell 370 field has some of 
the deepest radio data ever taken, and both components of the galaxy are clearly 
detected in the images [158]. The SED of the galaxy is close to that expected from 
the observed far-IR-radio correlation at low redshifts. 10 x 10 arcsec near-IR [K- 
band) and optical (R-band) images are shown in the right-hand frames. The new 
HST ACS camera should soon reveal exquisite resolution images of this galaxy. 



identification, assuming that it contained extensive cool (~ 20 K) dust [241]. 
However, its optical spectrum shows no sign of the vigorous star-formation 
activity from the [Oii]/[Oiii] line ratio or a strong Ha recombination line, 
and so the ERO H5 is a much more compelling candidate. 

7.1 Radio (and Optical) Pre-selection 

We previously discussed the success and otherwise of making connections 
between the populations of distant galaxies detected at optical and submil- 
limetre wavelengths. To first order, there appears to be little connection be- 
tween the populations at the current mutual detection thresholds. If deeper 
images are ultimately available, then any object with the SED of a typical 
low-redshift IRAS galaxy should be detectable at both wavelengths. The mis- 
match between the angular resolution of the first generation of far-IR and 
submillimetre surveys and the exquisite sub-arcsec images flowing from HST 
and ground-based near-IR instruments remains a problem. The total num- 
ber, surface density and limiting luminosity for detection of optically-selected 
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Fig. 22. Another high-redshift (z = 2.56) submillimetre-selected merging galaxy, 
SMMJ1401 1+0252 behind the cluster Abell 1835 [157]. The multiwaveband data 
[243] is presented in the same format as Fig. 21. This time an HST image is avail- 
able, showing a dramatic interacting and distorted red eastern component to the 
galaxy (Jl) ]155]. The galaxy was detected in the redshifted CO(3-2) line using the 
OVRO MMA, absolutely confirming the identification ]73]. The northwesterly blue 
component at the same redshift could be selected in a LBG survey. However, ra- 
dio {black contours, second panel) and millimetre- wave interferometer images show 
that the luminosity of this galaxy at all wavelengths is likely to be dominated by 
emission from the very red region to the north of the fragmented Jl ]157j. 




Fig. 23. A more typical submillimetre galaxy with no redshift and confirmed iden- 
tification. The images are presented in the same format as the preceding figures 
[243]. The ERO close to the centroid of the submillimetre emission is the likely 
association. However, at A ~ 19.1 it is several magnitudes brighter than the bulk 
of submillimetre galaxies. The near-IR A-band data and radio emission is a crucial 
link to the proposed identification: the presence of radio emission reveals that, even 
without considering the submillimetre emission, the ERO is unlikely to be a pas- 
sively evolving elliptical galaxy with red colours due to a quiescent population of 
evolved stars, but is probably due to a strongly reddened young stellar population. 



galaxies still exceeds the equivalent values in the submillimetre by at least 
an order of magnitude: well over a thousand high-redshift optically-selected 
LBGs have detailed spectra [254]. Nevertheless, observations of reprocessed 
dust emission are important to account for the large fraction of energy that 
can be missed in optical surveys, and to find galaxies that fall outside their 
selection criteria. It is important to link together the properties of all high- 
redshift galaxies if we are to understand galaxy formation in full. Some of 
the interesting populations of galaxies that have been defined at other wave- 
lengths but investigated in the submillimetre waveband are listed below. 
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Fig. 24. The history of energy generation in the Universe, parameterized as a star- 
formation rate per unit comoving volume. The absolute normalization of the curves 
depends on the assumed stellar IMF and the fraction of the dust-enshrouded lu- 
minosity of galaxies that is generated by AGN. The points show results derived 
from a large number of optical and near-IR studies, for which detailed references 
can be found elsewhere [57,243]. The most important results are the filled stars 
[177] and high-redshift diagonal crosses [254]. The up-pointing arrow comes from 
a submillimetre-based estimate of star-formation rate from the HDF [150]. An im- 
portant recent measurement of the extinction-free low-redshift star-formation rate 
from radio data from [275] is included. The thick solid and dashed lines represent 
current best fits to far-IR and submillimetre data in a simple luminosity evolution 
model and a hierarchical model of luminous merging galaxies respectively, as up- 
dated to reflect additional data and a currently favored non-zero-T cosmology. The 
thinner solid lines show the approximate envelope of 68% uncertainty in the results 
of the luminosity evolution model. The thin and thick dotted lines represent the 
best-fitting results obtained in the original derivations [56,54]. 



Optically-Selected Galaxies: LEG samples [254] provide thousands of 
spectroscopically confirmed galaxies with luminosities as low as L* at red- 
shifts 2.5 < 2 < 4.5, by detecting the redshifted 91.2-nm hydrogen absorption 
edge between various standard optical observing bands. The selection effects 
that apply to LBGs, faint radio sources and submillimetre galaxies are quite 
different [3]. The key question is whether sufficiently deep LEG surveys can 
always detect faint traces of optical emission escaping from dusty galaxies 
and identify them. This is difficult to decide based on the limited ability 
of existing submillimetre-wave telescopes to probe the faint fluxes expected 
from LBGs. As discussed in Sect. 6.5, while low-luminosity galaxies can prob- 





Andrew W. Blain 



ably always be detected both from stellar and dust emission, at the highest 
luminosities dusty galaxies appear to be more difficult to detect optically, 
and it is certainly very difficult to assign accurate luminosities from optical 
observations alone [139]. 



Near-IR Selected Galaxies: EROs and even ‘Highly ERO (HEROs)’ [259], 
have been detected using large format near-IR cameras. These populations 
are likely to overlap with dust-enshrouded galaxies at some level. At K ~ 
20 most EROs seem to be evolved elliptical galaxies at moderate redshifts 
{z < 1); however, it is likely that the small energetic contribution of redshifted 
optical emission from dusty high-redshift galaxies will be detectable in deeper 
near-IR images [127]. 

7.2 Gamma-Ray Burst (GRB) Host Galaxies 

There is a race to understand and identify the host galaxies of GRBs [98], 
which are typically associated with faint blue galaxies. In some cases. X-ray 
spectra with unusually high iron abundances have been inferred, that could 
be the debris of a high-mass star ejected in powerful winds or in a supernova. 
Red transient emission has also been detected, that would be consistent with 
a supernova light curve in the decaying power-law emission from the cooling 
GRB fireball. These are all potential signs that GRBs are associated with 
the deaths of at least some massive stars. This could provide an obscuration- 
independent route to investigating star-formation in the distant Universe, as 
7 -rays are immune to extinction by dust, and so neither the presence or ab- 
sence of dust in the hosts should bias their detection at y-ray energies, unlike 
far-IR or optical observations of the host itself. So far, several bursts appear 
to be associated with dusty galaxies [36,123], broadly consistent with a pic- 
ture in which a significant fraction of high-redshift, high-mass star formation 
activity is obscured by dust. However, no systematic deep survey of GRB 
hosts has been made to test this view [246,26].^® Much more extensive obser- 
vations are necessary to confirm and quantify the connection. The resolution 
and sensitivity of ALMA should make this an easy task, with minute-long 
exposures being required to image each of the host galaxies of the several 
thousand GRBs that should have been catalogued with accurate positions by 
2010 . 

8 Galaxy Evolution: A Global, Multiwavelength View 

Since there are now many thousands of optically-selected galaxies at red- 
shifts greater than 3, and at least 100 AGN at z > 5 and about 300 dusty 

A program at JCMT with Vicki Barnard and Nial Tanvir to search more system- 
atically for host emission began in acceptable weather in Hawaii the week after 
the Saas Fee meeting [26]. See also Berger et al. [37]. 
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ultraluminous galaxies likely to lie at z > 1, we are starting to build up a 
reasonable census of the contents of the high-redshift Universe (still subject 
to potential surprises!). The challenge to ensure that there are no classes of 
objects missing from the census, and to push deeper to try to see the first 
luminous objects is now a real one. A key question is whether any inferred 
form of the evolution of galaxies shown in Fig. 24 is correct, and to extend the 
results to higher redshifts. There are hints that the opacity of the Universe 
in Lyman-a is rising through unity at z > 6, and so rest-frame UV observa- 
tions will provide a guide to the more distant Universe, unless the galaxies 
ionize large enough surrounding regions to redshift their emission out of the 
absorption range. Rest-frame optical emission can be sought in the difficult 
to observe near-IR waveband, while dusty objects can be observed beyond re- 
ionization at (sub) millimetre wavelengths, and the associated ionized gas can 
be detected from their radio free-free emission (Fig. 10). At even higher red- 
shifts, the dust abundance is expected to decline, yet free-free emission from 
even the earliest galaxies could still be detected using a sufficiently sensitive 
telescope, requiring nothing more than the presence of ionized hydrogen. 

It will be a great challenge to find sources sufficiently bright against which 
to probe the pre-reionization Universe in absorption at these wavelengths; 
however the radio. X-ray and IR continuum emission from GRBs associated 
with the very first stars may offer a powerful, if short-lived and unreproduce- 
able source. 

8.1 Observing First Light 

The next ten years of observational cosmology is full of promise. New powerful 
multi-object spectrographs on 8-10-m class telescopes, including DEIMOS 
on Keck, VIMOS on VLT and CMOS on Gemini are poised to provide new 
very deep redshift surveys to extend the influential results from the 4-m-class 
GFRS redshift survey [177] out to z > 1. With surveys from the forthcoming 
SIRTF IR space mission, these should define the evolution of galaxies and 
their constituent stellar populations accurately out to redshifts z ~ 2. 

At longer wavelengths, ALMA will provide extremely sensitive resolved 
images of dust and molecular gas in metal-enriched galaxies at any redshift, 
by exploiting the favourable submillimetre K correction out to redshifts where 
dust and metals first formed. The time required to generate dust in a first 
generation of stars is very small, corresponding to the lifetime of the most 
massive stars (~ 10® yr), combined with the comparable wind crossing time 
of a star-formation region (assuming 100 pc and 1000 kms“^). If the epoch of 
reionization has now indeed been detected, then perhaps the next key target 
in understanding galaxy formation is to determine the maximum redshift at 
which dust and metal-enriched molecular gas can be detected by ALMA. 

At higher redshifts, only the unobscured restframe visible and UV light 
from the first stars and free-free emission from the ionized gas could be de- 
tected, at mid-IR and radio wavelengths in the observer’s frame. At redshifts 
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in excess of 6 this is a key goal of deep surveys using the JWST. Ground-based 
telescopes are likely to be prevented from achieving the sensitivity required 
at near-IR wavelengths by atmospheric emission, while SIRTF does not have 
a large enough aperture to avoid confusion from the overlapping images of 
less distant galaxies and reach the necessary depths. 

Inverse Compton scattering from CMB photons will limit the lifetime of 
electrons generating synchrotron emission from supernova remnants at very 
high redshifts. The luminosity of radio synchrotron radiation depends on the 
energy density in the magnetic field, while the energy lost from the electron 
population to inverse Compton scattering depends on the energy density of 
all sources of radiation. The tight correlation between the radio and far- 
IR emission from galaxies requires that the ratio of magnetic and radiation 
energy densities is almost constant over 4 orders of magnitude of galaxy 
luminosity at low redshifts [85]. Note that in the most vigorous star-forming 
regions, dust temperatures are of order 100 K, indicating ISRF intensities that 
exceed the CMB energy density at 2 ~ 0 by a factor of (100/2. 73)"^ ~ 1.7x 10®. 
The intensity of the CMB increases as (1 -I- although it is a coincidence 
that this matches the inferred strength of evolution of the luminosity density 
of galaxies at low redshifts. The intensity of the CMB would not match this 
ISRF until a very high redshift ~ 36. In a more reasonable representative 
region of a luminous star- forming galaxy at T = 40K this occurs at up to 
0 ~ 14. In a galaxy like the Milky Way with T = 20 K it occurs at z ~ 6. 

However, the relativistic electrons that generate the radio emission are 
not bound to, and do not necessarily remain near to star-forming regions, 
and so are likely to experience a lower ISRF where the synchrotron emission 
is generated. Given the volume and total luminosity of the Milky Way, the 
energy density in radiation averaged over the whole volume is only about 
2.5 times greater than in the CMB. Hence, equality in the ISRF and CMB 
energy densities will occur at only z ~ 1.5, and beyond z = 2 the CMB will 
dominate the radiation field, effectively quenching synchrotron emission by 
providing much more rapid cooling of electrons. The resulting upscattered 
photons will be in the X-/ 7 -ray regime, but their total luminosity will be 
extremely small and difficult to observe, as the fraction of the bolometric 
luminosity of galaxies radiated in the radio waveband is only of order 10“®, 
less than the standard X-ray emission from compact binaries in star-forming 
galaxies. 

In hotter dusty galaxies, reflecting a more intense ISRF, the domination of 
the energy density by the CMB will not occur until higher redshifts. Since the 
intensity of thermal emission, determined by the intensity of the ISRF, scales 
as and the intensity of the CMB scales as (H-z)^, equality would occur 
at z ~ 5.5 for a 40-K dust source and z ~ 15 for an 80-K dust source. There 
is a small secondary effect, that heating of dust by the CMB imposes a lower 
limit to the dust temperature, but this modification is much less significant 
than the fractional reduction in the radio emission. Note, that even if the 
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CMB quenches synchrotron emission, it will not affect the observability of 
free-free emission from ionized gas at even the highest redshifts. The chance 
of detecting almost flat-spectrum free-free emission from very high redshift 
galaxies using a future centimetre- wave radio telescope is excellent. A galaxy 
with a bolometric luminosity of 10^° Lq is expected to generate a free-free 
flux density of about 1 /xJy at any observed radio frequency in the GHz range 
out to 2 : ~ 20 (Fig. 10). If any such objects are compact, and sufficient angular 
resolution is available to avoid the effects of confusion, then the signs of the 
first ionization by the youngest galaxies in the Universe could be imaged in 
this way. 

9 Future Observations of the Distant Cold Universe 

The opportunities to study distant galaxies in detail at submillimetre wave- 
lengths depends on the availability of instruments with sufficient sensitivity 
and angular resolution to provide detailed astrophysical diagnostics for the 
detected sources. 

The availability of the SCUBA and MAMBO cameras, with fields of view 
of order 5 arcmin made the first surveys of the distant dusty Universe possi- 
ble, while at shorter wavelengths and lower redshifts, ISO provided the first 
glimpse of distant galaxies (Fig. 14). From these galaxy populations, and the 
background radiation intensity derived using the COBE satellite, the surface 
densities of faint dusty galaxies are known within a factor of about 30%, and 
the intensity of the background radiation has been constrained to within a 
similar factor from 2 to 200 /xm. At shorter wavelengths, the background in- 
tensity is constrained to within a decade-wide strip throughout the far- to 
near-IR wavebands. A reasonable picture of the significance and nature of 
the emission of dust radiation from galaxies has thus been developed. 

However, our exploration of the distant Universe at these long wavelengths 
has just begun. Only about 0.3 deg^ of sky has been mapped in submillimetre- 
wave surveys, while the extent of ISO images that are sufficiently deep to 
provide cosmological information is only about 50 arcmin^ . None of these 
images has the resolution to resolve distant galaxies. We will now address 
the issues of sensitivity, resolution and diagnostic power, and instruments 
that are being developed to study the cold Universe in more detail over the 
years ahead. It is important to remember that all these observations will 
require multiwavelength support from radio, optical and X-ray observations 
to maximize their scientific return. 

9.1 Sensitivity and Field of View 

There is no substitute for a telescope with a large total collecting area, to im- 
prove signal-to-noise ratios, even without the advantage of improved diffrac- 
tion limited performance. It is important, however, to limit the thermal emis- 
sion from the telescope, as this can easily be the dominant source of noise at 
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mid-IR and longer wavelengths. This thermal restriction has so far limited 
space-borne apertures to under 1-m. IRAS, ISO, SIRTF and ASTRO-F all 
lie in this range. The first larger aperture to fly in space will be the 3.5-m 
Herschel observatory, at about 70 K. The SOFIA airborne observatory and 
the BLAST balloon-borne instrument will field comparably sized warm, more 
emissive antennas, as they are in contact with the 230-K stratosphere. The 
remnant atmosphere about 15000 m for SOFIA and 40000 m for BLAST will 
also contribute noise. 

In 2-3 years, SOFIA is expected to reach the confusion limit imposed by 
the beamsize and the surface density of galaxies after about 1 week of obser- 
vations staring at the same field, a comparable time to SCUBA at present. 
As a result, it will be suitable for making very narrow-field (arcmin^) deep 
surveys, to probe between the much larger number of galaxies that will be 
detected in wider fields at coarser resolution using SIRTF, and to provide a 
first look at the sub-mJy mid-/far-IR sky. At 2 : ~ 1 the SOFIA confusion 
limit corresponds to the flux density of a IO^^-Lq L* spiral galaxy. To probe 
large numbers of galaxies to these depths will require the greater sensitivity 
of Herschel in space, which should reach the confusion limit in a 200 — /im 
staring observation in about Ihour. With a field of view of order 40arcmin^, 
a Herschel survey of 1 deg^ would take approximately 4 days of constant ob- 
serving. The all-sky (sub)millimetre-wave survey that will be carried out by 
the Planck Surveyor space mission, primarily to image the CMB on scales 
larger than 5 arcmin will provide a sensitive comparison image for these future 
surveys. Planck is currently scheduled to fly alongside Herschel. The ultimate 
sensitivity of Planck at the shortest wavelengths will be limited by the unre- 
solved structure from confused galaxies [52], and it will detect a large number 
(104-5) submillimetre galaxies at least about 10 times brighter than those 
that are being reported from SCUBA surveys. 

The largest existing ground-based submillimetre- wave telescope is the 15- 
m JCMT, and the comparably sized 8 x 6-m SMA interferometer. At millime- 
tre wavelengths the 30-m aperture IRAM telescope at Pico Veleta, and the 
6 X 15-m aperture IRAM PDB interferometer provide the greatest collecting 
areas. The forthcoming CARMA composite array of 8 x 6-m and 6 x 10.4-m 
antennas in the Inyo mountains of California will provide a collecting area ap- 
proximately 70% as large, with the potential for excellent image fidelity from 
a greater number of longer baselines. The 100-m clear-aperture Greenbank 
Telescope (GBT) should be capable of operating at 3 mm during the winter 
in West Virginia, while a 50-m millimetre- wave telescope - the LMT / GTM is 
being constructed near Puebla in Mexico to operate at 1.1 mm. The 64 x 12-m 
ALMA will provide a collecting area comparable to the GBT, in a variety of 
interferometer configurations, with baselines up to 10 km. 

For interferometers the collecting area is the key guide to sensitivity, as- 
suming a reasonable number of several tens of baseline pairs and sensitive 
receivers. However, for single-antenna telescopes the maximum size of the 
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field of view, and the fraction filled with detectors is usually more important. 
For example, current bolometer arrays, SCUBA and MAMBO, are only a 
small factor more sensitive per detector than the single-pixel bolometer de- 
tectors that they superseded. Their advantage comes from imaging several 
tens of beams on the sky simultaneously, and from being able to track and 
subtract fluctuations in atmospheric emission across the whole array. 

The next generation of bolometer cameras will be several times more 
sensitive, and field much larger numbers of detectors. The BOLOCAM and 
SHARC-II cameras are being commissioned at the CSO, using monolithic ar- 
rays of 151 and 384 detectors respectively in the 2.1/1.4/1.1-mm and 350— /rm 
atmospheric transmission windows. BOLOCAM can image the Sunyaev- 
Zeldovich effect from clusters out to any redshift, and detect luminous distant 
dusty galaxies. SHARC-II provides wide-held images of star-forming regions 
in the Milky Way, make follow-up observations and obtain accurate photom- 
etry for distant galaxies detected at other wavelengths, and make dedicated 
pencil-beam deep surveys to determine the deep counts of distant galaxies 
[7]. It is now possible to begin the design process for cameras with many 
thousands of pixels, potentially able to image helds almost 1 deg^ in size on 
a 10-m telescope. A large-format camera such as SCUBA-II being designed 
at the UK-ATC in Edinburgh would be able to reach the confusion limit on 
a 15-m telescope in about 1 hr per 64-arcmin^ held in good weather from the 
4000-m Mauna Kea observing site. A 64 x 64 element array has been dis- 
cussed as a possible detector in this camera. This is possible because of the 
development of very stable, superconducting transition edge sensor (TES) 
bolometers, as demonstrated for the hrst time in May 2001 with the FIBRE 
spectrograph [33]. There are several other highly innovative bolometer de- 
tector developments under way. Developments in technology could make a 
256 X 256 array practicable at the time that Herschel hies and ALMA is 
in service in about 2010, to complement these facilities. With a deep map- 
ping speed of order 1 deg^ per night, submillimetre- wave surveys using such a 
camera could eclipse the existing capabilities of optical and radio telescopes 
to detect raw numbers of galaxies at z > 1. Large-format arrays may even 
allow accurate subtraction of varying atmospheric emission, allowing surveys 
to continue in less-stable daylight conditions. Remember that in the almost 
5 years of submillimetre observations using SCUBA, only about 0.3 deg^ of 
sky has been imaged to a cosmologically useful depth, less than a third of the 
minimum area required to survey a representative region of the high-redshift 
Universe. 

9.2 Resolution 

Future ground-based and air- and space-borne instruments will have the ca- 
pability to survey representative volumes of the distant Universe at long 
wavelengths. However, SIRTF, SOFIA, SCUBA-H and Planck will not have 
the resolving power required to detect substructure in the galaxies they hnd. 
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Studies of the nature of the detected objects will require observations either 
at shorter wavelengths, or using interferometers. To keep pace with the de- 
tection rates, the sensitivity of ALMA will be required. ALMA will be able 
to image a 2-mJy galaxy detected at 850 /im in about Imin, allowing a sys- 
tematic high-resolution programme of follow-up observations. With a 1-mm 
resolution of 20 milliarcsec on a 10-km baseline, ALMA will easily resolve 
200-pc scale structures in high-redshift galaxies. 

At shorter wavelengths, there are several routes to higher-resolution im- 
ages of distant galaxies. A filled-aperture 10-m class space telescope has been 
discussed (SAFIR), which would provide a very sensitive deep sub-arcsec res- 
olution survey at a wavelength of order 10 /rm. For more detailed studies of 
the astrophysics of distant galaxies, resolving individual examples in detail, 
a prototype 10-20-m baseline mid-IR interferometer SPIRIT has been pro- 
posed, as a vehicle for technology development leading to 1-km-scale baseline 
free-flying interferometers like SPECS [191]. 

These instruments are all matched to the complementary high-resolution 
capabilities of the near-IR JWST, and to the extended VLA and SKA radio 
telescopes. For spectroscopic follow-up of the most distant galaxies detected, 
a 30-m class optical telescope, such as the California Extremely Large Tele- 
scope (CELT) concept that is under study will be required. This type of 
facility may already be required to follow up a significant fraction of existing 
submillimetre-selected galaxies in detail. 

9.3 Other Techniques 

A first glimpse of the high-redshift dust-enshrouded Universe has been pro- 
vided by imaging continuum detectors. As described elsewhere in the volume 
[84], vital astrophysical information is provided by observations of submilli- 
metre-wave spectral lines. 

One way to make additional progress, is to combine spectral line infor- 
mation for galaxies selected by their continuum emission. If the output is 
dispersed in frequency, then there is additional protection against the effects 
of confusion noise [63]. At shorter mid- and far-IR wavelengths, fine-structure 
emission lines from low-ionization gas in excited regions of distant galaxies, 
and from the molecular hydrogen rotational and vibrational line emission 
expected to be associated with gas cooling in the first galaxy halos will be 
targets for detection, if wide-band instruments are available. Spectral data 
would allow sources close enough on the sky to have ambiguous positions 
and continuum flux densities to be distinguished, unless they were physically 
associated the confused galaxies are unlikely to lie at the same redshift. This 
will be important if existing samples of submillimetre-selected galaxies are 
observed at the resolution of Herschel. 

Ground-based and space-borne line surveys will provide a valuable com- 
plement to future very deep continuum images. ALMA [272] and SPECS 
[191] already include the possibility of making combined line and continuum 
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surveys, although their potential 10-milliarcsec and sub-milliarcsec resolu- 
tion respectively means that neither has to rely on spectral line observations 
to avoid confusion. It is likely that both innovative spectrographs on even 
relatively small, cold space-borne telescopes, and wide-band CO line survey 
instruments on the ground will allow the distant cold Universe to be explored 
at a drastically increasing rate over the next decade. 

10 Summary 

The goal of this contribution has been to point out some of the special oppor- 
tunities for studying the formation and evolution of galaxies in the wavelength 
range from 10 /rm to about 3 mm, especially using dust continuum radiation. 

This represents one of the last wavebands to be investigated at the rele- 
vant depths for cosmological surveys, and has yielded a haul of unexpectedly 
luminous galaxies. The most luminous QSOs detected at optical wavelengths 
since the early 1960s are matched in luminosity by galaxies that emit the 
great majority of their luminosity at far-IR wavelengths, and at submillime- 
tre wavelengths where they could be detected out to extremely high redshifts. 
As some of the most interesting galaxies in the Universe, they must be ex- 
plained by any complete model of galaxy formation. 

At present only about 0.3 deg^ of sky has been imaged at submillimetre 
wavelengths, detecting several hundred galaxies. These vital first steps were 
possible by the great commitment of time and effort of the teams centred 
at Edinburgh and Bonn that designed and built the SCUBA and MAMBO 
(sub)millimetre-wave cameras for the JCMT and IRAM 30-m telescope. This 
field is only beginning; it should mature rapidly now that larger submillimetre 
cameras are available at CSO (SHARC-II and BOLOCAM), and work is 
underway on LABOCA, a development of MAMBO for the APEX telescope^® 
at the ALMA site in Chile, and the SCUBA-2 camera for the JCMT. At 
shorter wavelengths the SIRTF and ASTRO-F space missions will provide 
much more information about mid- and far-IR galaxies; ASTRO-F is planned 
to produce a much more sensitive revised version of the IRAS all-sky survey. 
The 3.5-m Herschel satellite will subsequently provide better resolution, and 
its fellow traveler Planck Surveyor will provide a sensitive 5-arcmin-resolution 
map of the submillimetre sky, a comparable resource at long wavelengths to 
the IRAS survey that spurred most of the interest in the cold Universe. The 
2.5-m aperture SOFIA airborne observatory and ultralong duration balloon 
payloads, such as BLAST, with comparable warmer apertures, will provide 
opportunities for projects with shorter lead times than space missions, and fill 
various observational niches as they are discovered. Larger colder apertures 
and interferometers in space are planned for the more distant future. 

The detection of larger samples of submillimetre galaxies is an important 
goal, but it is unlikely to provide the answers to the key scientific questions 

http:/ /www.mpifr-bonn.mpg.de/div/mm/apex.htm 
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of their power source, lifetime, spatial extent, masses, metallicities etc. To 
address these questions high-angular resolution imaging, to resolve individual 
galaxies, and multiwaveband spectroscopy, will be required. Hopefully, in 
10 years, this discussion can be repeated after Herschel and ALMA are in 
service, and the true history of cool galaxies can be set out. 
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1 How to Observe the H 2 Component? 

1.1 The H 2 Molecule 

General Properties: H 2 is a symmetrical molecule, and has no electric 
dipole. Consequently, the rotational levels will not be coupled except by the 
quadrupole. The radiative transitions are then quadrupolar, very weak, and 
with selection rule AJ = ±2, which means that the odd and even-J levels 
will be independent. Since the parity of J is related to the orientation of the 
nuclear spins, this results in the existence of two “different” molecules, the 
para-H 2 , in which the nuclear spins of the two protons are anti-aligned, and 
the resulting spin 1 = 0, and the ortho-H 2 , for which the spins of the two 
protons are parallel and / = 1, see Fig. 1. 

Indeed, because of the Pauli exclusion principle, the wave function repre- 
senting the molecule must be antisymmetric. For the para-H 2 , the spin wave 
function is anti-symmetric, and the nuclear wave function must be symmet- 
ric. Since the two electrons are paired in the molecule, their total spin and 
angular momentum is zero. The angular momentum of the molecule is then 
only due to the rotation. The fundamental state is thus J = 0, for which 
there is a symmetric wave-function. For the ortho-H 2 , on the contrary, the 
spin wave function being symmetrical, the nuclear radial wave function must 
be antisymmetric, and the fundamental state of ortho-H 2 is J = 1. 

In addition, H 2 is the lightest molecule, with a very small inertial mo- 
ment, and therefore the energy levels and their separation are very high. The 
energy difference between the ground states of the para and ortho-hydrogen 
is already 170K//c, and the first coupled level is 512K/fc above ground: this 
is the first line, at 28/im, which is possible to observe. This means that the 
H 2 molecule, when very cold at the temperature of usual molecular clouds, 
around 10 — 15 K, will not radiate at all. 

H 2 is the most stable form of hydrogen at low temperature. It is dominant 
in planetary atmospheres. Its main formation process today, in normal molec- 
ular clouds in galactic disks, is on dust grains. However its formation is still 
possible in primordial gas devoid of heavy elements and dust, through ion- 
atom reactions in the gaseous phase, involving H-|-H“ (e.g. Palla et al. 1983). 

Its destruction is through photodissociation by UV photons (Lyman band) . 
It is however shielded by the atom HI, since the photodissociation continuum 
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Fig. 1. Schematic of the first levels of the H 2 molecule. The even-J levels are the 
para-hydrogen, while the odd-J levels are from the ortho-hydrogen. The coupled 
levels obey AJ = ±2, for quadrupolar transitions, and the two species are not 
radiatively coupled. The four hrst rotational lines of hydrogen S(J) (where J is the 
lower state) are represented - wavelengths in /im, S{3) — 9, S{2) = 12, S(l) = 17 
and ^(O) = 28. 



of H 2 starts at 14.7 eV, and its photo-ionization at 15.6 eV (while the HI 
ionization starts at 13.6 eV). 

Self-shielding begins at low column densities, of about 10^°cm“^ in stan- 
dard UV field. H 2 will be therefore be still present, while other molecules 
such as CO would be already photo-dissociated. 

Ortho-para Transitions: Non-reactive collisions cannot induce the ortho- 
para transitions (because of no dipole between the two states), but reactive 
collisions with protons (with an exchange of protons) may induce transitions 
for which AJ is odd. The rate coefficient for the 1 — >■ 0 transition has been 
estimated to be 10“^°cm^/s by Black & Dalgarno (1976). Gerlich (1990) has 
studied in detail the reaction H+ -I-H 2 and confirmed a rate of 2-10“^° cm^/s, 
nearly constant with temperature. At high temperature, the abundance ratio 
should be in favor of the ortho-H 2 , with an o/p ratio of 3, but at the very 
low temperature of the interstellar medium, this ratio should tend to zero 
as exp(— 170/r). However, the equilibrium might not be reached, since the 
formation of H 2 can involve exothermic reactions, and the new H 2 is formed 
in an excited state, or through catalytic recombination of H atoms on the 
surface of grains. There is a large energy of formation, 2.25eV/atom, and the 
formation in the para state is not favored. The transformation 0 -H 2 — >■ P-H 2 
being very slow, it is not likely that the thermal equilibrium is obtained, the 
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more so that the ortho-para coupling via proton-exchange favors the 0 -H 2 
(Gerlich, 1990). Observationally, abnormal ratios are indeed observed with 
the ISO satellite (e.g. Rodriguez-Fernandez et al. 2000). 

1.2 Infrared Lines of H 2 

From the ground, it is possible to detect easily only the vibrationally ex- 
cited lines around 2.2 ^m. These lines involve levels about 2500 K above the 
ground state, and are excited by fluorescence, through high-UV radiation 
near starbursts, or by shock waves, due to violent turbulence. This is the 
case in the ULIRGs galaxies NGG 6240 and Arp 220 for instance, where the 
H2W=1— l'OS'(l) line is significantly brighter than the Bry line, contrary 
to “normal” starbursts (van der Werf 2001). These powerful and numerous 
shocks indicate large gas infall and fueling of these ultra- luminous galaxies. 

As for the pure rotational lines, they require space observations. The ISO 
satellite has detected the first lines S'(O) to S'(3) in many galactic sources, 
and in some external galaxies. In the edge-on galaxy NGG 891, for instance, 
Valentijn & van der Werf (1999) have shown that the two first lines are 
detected quite far from the center of the galaxy, and the radial profile is de- 
clining much slower than the GO luminosity profile (see Fig. 2). Although 
these lines are emitted by warm H 2 , they are a tracer of the more abundant 
cold molecular hydrogen, independent of metallicity, which might reveal pre- 
cious in the outer parts of galaxies. The derived H 2 column densities are much 
larger than that of HI. 




Fig. 2. Major axis profiles in the edge-on galaxy NGC891 of the two fundamental 
rotation lines of H 2 : S(0) {filled circles) and S'(l) {open circles), compared to the 
CO profile {full line)', a) velocity profiles b) integrated line strengths (from Valentijn 
& van der Werf 1999). 
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1.3 UV Lines of H 2 

Electronic lines of H 2 in the UV have very high transition probabilities, and 
therefore are easy to observe. The interstellar gas is detected in absorption in 
front of nearby stars. However, it is possible to probe only low-extinction line 
of sights (Ay < 1.5), and the lines are very quickly saturated. The Copernicus 
satellite was the first in the ’70s to calibrate the H 2 column densities with 
reddening (Savage et al. 1977), and also the H 2 /CO ratio, by observing both 
molecules in UV absorption. 

A wealth of new data is now brought by the FUSE satellite. Column 
densities as low as A^(H 2 ) = 10^^cm“^ are currently detected. Molecular 
hydrogen is ubiquitous in our Galaxy (Shull et al. 2000; Rachford et al. 2001) 
in translucent and diffuse clouds. The molecular fraction in these clouds is of 
the order of a few percents, and the /(H 2 ) fraction is then increasing steeply 
towards dense molecular clouds, entirely molecular. The absorption towards 
the Magellanic Clouds reveals reduced H 2 abundances, due to high UV field 
(Tumlinson et al. 2002, see Fig. 3). H 2 in absorption has also been detected in 
High Velocity Clouds (HVC), while the latter are not detected in CO (Richter 
et al. 2001). 




Fig. 3. FUSE spectrum in front of an LMC star, with a column density of A(H 2 ) = 
cm“^ in J = 0 — 3. On the same line of sight, absorptions are coming from 
the Milky Way (MW) and Large Magellanic Cloud (LMC) and are identified. With 
this technique, only diffuse or translucent clouds can be probed, where the molecular 
fraction is a few percents (from Tumlinson et al. 2002). 
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Table 1. Critical density ricrit for which CuijA^i = 1. 



Molecule 


CO 


NH 3 


CS 


HCN 


[Debye] 


0.1 


1.5 


2.0 


3.0 


Ucrit [cm“®] 


4- 10"^ 


1.1 • 10® 


1.1 • 10® 


1.6- 10 '^ 



1.4 The CO Tracer 

Since the cold molecular hydrogen does not radiate, the best tracer in galaxies 
is the most abundant molecule after H 2 , i.e. the carbon monoxide CO. It has 
a small dipole {n = 0.1 Debye), and therefore requires only mild density to be 
excited in its rotational lines, that fall in the millimeter range. The molecule 
is relatively heavy so that the first level J = 1 is at 5.2 K above the ground 
state, and is therefore perfectly fitted to probe the cold molecular clouds. 
The CO abundance with respect to H 2 is of the order of 10“®. The excita- 
tion is done by collisions with H 2 essentially. The spontaneous de-excitation 
rate A^i is proportional to and therefore a weak dipole indicates that the 
critical density ricrit for excitation, where the collisional rate Cui becomes 
equal to the radiative rate A^i, is small. The competition between collisional 
excitation and radiative transitions, is quantified by the ratio CuijAui which 
is proportional to n(H 2 ) Some values for critical densities rela- 

tive to current molecules are plotted in Table 1. HCN and CS, for instance, 
are called high density tracers, since their emission is detected towards the 
densest parts of the clouds. Various tracers have thus to be used, CO for the 
wide scale more diffuse and extended medium, HCN, CS, for the dense cores, 
etc. . . 

The widely used CO lines (J = 1 — 0 at 2.6 mm, J = 2 — 1 at 1.3 mm) 
are generally optically thick, for “standard” clouds of solar metallicity. This 
complicates the interpretation in terms of H 2 column densities and mass of 
the clouds. More precisely, each individual molecular cloud is optically thick. 
However, a giant molecular cloud is composed of many sub-structures, and 
towards a given external galaxy, there are many clouds along a line of sight. 
The “macroscopic” optical depth is in general not very large, due to velocity 
gradients: clouds do not hide each other at a given velocity, and therefore it 
is possible to “count” the clouds. 

Assuming that there exists a proportionality relation between the ob- 
served CO intensity and the H 2 column density, several calibration procedures 
have been tried: 

• Observing the isotopic molecule ^^CO, the UV lines in absorption. 

• Statistics and counting the “standard” clouds. 

• Using the Virial hypothesis. 
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Isotopic Species: Since the CO lines are optically thick (in average tco ~ 
10), it might be interesting to observe the much less abundant isotope ^^CO. 
At the solar radius, the ^^CO/^^CO abundance ratio is ~ 90, and in indi- 
vidual molecular clouds, it is frequent that the lines are optically thin. 

The observed average ratio between integrated and intensities is 

of the order of 10. 

For diffuse line of sights, it has been possible to calibrate the CO/H2 
ratio, through successive calibrations, knowing and H2 in the solar 

neighbourhood by direct observations of these lines in UV absorption. How- 
ever, the ratio does not apply to dense molecular clouds, representing most 
of the molecular mass in the Galaxy. Also, when maps of the two isotopes 
are compared at millimetric wavelengths for well studied clouds, it appears 
that the does not probe the entire extent of the clouds, but only the 

densest regions. This could be due to problems of excitation, self-shielding, 
or selective photo-dissociation. 



Statistical Count of the Clouds: A more empirical method is to assume 
that the molecular content of external galaxies is composed statistically of 
the same “standard” clouds, of typical masses of 10^ Mq. For usual spatial 
resolutions, there are numerous clouds in the beam, of the order of 10"^ or 
10® clouds. They are not likely to overlap, since they are well separated in 
velocity. The filling factor is the product of the surface factor /g and a velocity 
factor fy and it is assumed that fsfv «C 1. This is the case actually, since the 
antenna temperature observed in nearby galaxies is of the order of TJ ~ 0.1 K 
for nearby galaxies, while it is ~ lOK for an individual typical cloud. This 
method of “counting” the number of clouds in the beam justifies a constant 
factor between the integrated CO intensity I(CO) and N(H2). 



Virial Method: On more physical grounds is the “virial” method. Most 
molecular clouds are gravitationally bound, as has been checked with de- 
tailed studies of individual nearby clouds in the Milky Way. There is a good 
correlation between the virial mass and the CO luminosity, as shown in Fig. 4, 
but the slope is not 1, which means that it is not possible to calibrate the 
conversion ratio for all clouds. If the clouds are not statistically similar from 
one galaxy to the next, the conversion factor could be significantly wrong. 

Assuming statistically that each cloud contributes to the same antenna 
temperature in average, reflecting the excitation temperature of the gas, 
the width of the spectrum AV gives the cloud mass M through the virial 
relation, 

AV^rr^GM, ( 1 ) 

where r is a typical size for the clouds. 

The conversion ratio N(H2)/I(CO) can then be computed as a function 
of average brightness Tr = T^/rj (with 77 the beam efficiency), and average 
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Fig. 4. Virial mass versus CO luminosity relation taken from various sources: 
Dam 86: Dame et al. (1986); Sol 87: Solomon et al. (1987); Heit98: Heithansen et 
al. (1998); MBM85: Magnani et al. (1985); Will 94: Williams et al. (1994); Fal92: 
Falgarone et al. (1992); W 95: Wang et al. (1995); Ward 94: Ward-Thompson et 
al. (1994); Lem 95: Lemme et al. (1995). An indicative line of slope 0.76 is drawn. 




density of clouds n. Let us consider N clouds in the beam, of diameter d, 
with a projected area a = f4:, and a velocity dispersion AV. If the area 

of the beam 6 is written A = Tr{6D)'^ /4, then the integrated CO emission is: 

/(CO) = A-^N—TrAV . (2) 

The mean H 2 surface density is Af(H 2 ) = A~^N{Trd? /A)n, and the conversion 
ratio a: = Af(H 2 )//(CO) = lnd/{TRAV). 

From the Virial hypothesis AV ~ lA^'^d and the conversion ratio varies 
as ITr. This factor is about 2.8 • 10^° cm“^/(km/s) for Tr ~ lOK and 
n ~ 200 cm“^. 

This simple model predicts a low dependence on metallicity for the con- 
version ratio, since the clouds have high optical thickness and are considered 
to have top-hat profiles (no changes of sizes with metallicity). 

However, for deficient galaxies such as the Magellanic Clouds, where 
clouds can be resolved, and the virial method individually applies, the conver- 
sion factor appears highly dependent on metallicity. This can be explained 
considering that the size of CO-emitting clouds (where the optical depth 
r ~ 1) varies strongly with metallicity. 
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Let us assume for instance an isothermal profile for the cloud density (not 
too far from the reality), with p oc then the column density varies as 
A^(H 2 ) (X r~^, and the diameter of clouds, where optical depth becomes unity, 
is varying linearly with the abundance of CO, or in the metallicity Z =(0/H). 
The surface filling factor of clouds varies as , and the conversion ratio varies 
with metallicity more rapidly than linearly. This variation could be even more 
rapid if the C abundance is proportional to oxygen C/0 ~ 0/H in galaxies, 
and CO/H 2 ~ (0/H)2. 

In external galaxies, the ratio between HI and derived H 2 masses M(H 2 )/ 
M(HI) appears to vary indeed as (0/H)^ (Arnault et al. 1988; Taylor et 
al. 1998). On the contrary, in the very center of starbursts galaxies, an over- 
abundance of CO could lead to overestimate the molecular content. The issue 
is not clear nor definite, since there are many varying parameters, with con- 
tradictory resulting effects: Tr is larger in starbursts, but the density n(H 2 ) 
too (as shown by the high-density tracers such as HCN), and fV(H 2 )//(CO) 
varies as the ratio 

In addition, chemical peculiarities have been observed in starbursts. ^^C 
is a primary element, while ^^C is secondary. The isotopic ratio ^^CO/^^CO 
is enhanced, mimicking more optically thin gas. But a more “normal” C^®0 
allows to disentangle the effects (Casoli et al. 1992; Henkel et al. 1993). 



1.5 The Dust Tracer 



Another tracer helps to determine the molecular content in galaxy disks, the 
continuum emission of dust in the millimeter domain. At A = 1mm, the dust 
emission is in the Rayleigh- Jeans domain 






2kT 



(3) 



The flux is quasi-linear in temperature (between 20 and 40K), and in most 
cases, the dust emission is optically thin. The emission is however expected 
to be linearly varying with metallicity Z, and Z decreases exponentially with 
radius. The problem of tracing the molecular component in the outer parts 
of galaxies remain. 

When the molecular component dominates in galaxies, the CO emission 
profile follows the dust profile (for example NGC891, Fig. 5). When the HI 
dominates, on the contrary, the dust does not fall as rapidly as CO with 
radius, but follows a little more the HI, while still falling with radius (for 
example NGC4565, Fig. 5). This reveals that CO might be a poor tracer of 
H 2 , even more dependent on metallicity than the dust emission. 

The excitation effects also combine to metallicity, and might explain why 
CO emission drops more rapidly than dust with radius. 

The rotational ladder of the CO molecule, and in particular the CO(2— 1) 
line tells us about the excitation. For optically thick lines, at sufficient density. 
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X(arcsec) on major axis 

Fig. 5. Dust emission radial profiles in NGC891 and NGG4565, compared to the 
CO and HI radial profiles, from Guelin et al. (1993), and Neininger et al. (1996). 



the CO(2— l)/CO(l— 0) ratio is of the order of 1. For optically thin gas, when 
the two lines have the same excitation temperature Tex, the ratio could reach 
the asymptotic value of 4. 

In most of the disk of galaxies, except towards the very center, the CO 
lines appear subthermally excited. The intensity ratio between the CO (2—1) 
and CO(l— 0) line is in average below 1, of the order of 0.6 (like in the Milky 
Way, Sakamoto et al. 1995). This means that the medium is not dense enough 
to excite the CO molecule to the second level (the more so as the critical 
density is larger for larger J). Even if the kinetic temperature is sufficient 
to excite the upper level of the J = 2 — 1 transition, the density is not, and 
Tex(2 — 1) < Tex(l — 0). Molecular clouds being denser in the nuclei, due to 
larger tidal forces that select only dense clouds, the CO(2— l)/CO(l— 0) ratio 
is there of the order of 1 (Braine & Combes 1992, Fig. 6). 
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Fig. 6. (Left) The CO(2— 1) intensity versus the CO(l— 0) intensity in a sam- 
ple of external galaxies, according their environment, or interacting state. The 
CO(2— l)/CO(l— 0) ratio is very nearly one, whatever the state. {Right) The 
CO(2— l)/CO(l— 0) ratio does not depend on the presence of bars, not on mor- 
phological type (from Braine & Combes 1992). 



1.6 Conclusion 

The cold H 2 molecule is invisible, and unfortunately the molecular clouds 
component, which represents the bulk of the mass of the interstellar medium 
in most spiral galaxies, is difficult to estimate. 

The CO molecule, still our best tracer of H 2 , is not quite reliable, because 
of metallicity effects (non-linear dependence), but also photo-dissociation in 
UV-bright environments, sub-thermal excitation in diffuse media, etc. . . 

It is very important to have other tracers, like mm dust emission, or dense 
core tracers, HCN, HCO'*', and isotopes. The H 2 pure rotational lines, emitted 
by the “warm” H 2 , could be a precious tracer of the colder H 2 , independent 
of metallicity. 

2 Molecular Component in the Milky Way 

2.1 CO Surveys of the Milky Way 

The CO lines being our best tracer of the cold molecular hydrogen, many CO 
surveys of the Milky Way have been reported in the literature; they consist 
of either a high-resolution (small beam, 1’) statistical sampling of the galac- 
tic plane (with small surface mapped), to a more complete, low-resolution 
(large beam 8’), high-coverage surveys, like the Columbia-CfA survey. Most 
of them have been done in the ^^CO(l-O) line, but other were specialized in 
the isotopic line ^^CO(l-O) or the excited line CO (2-1). Here are some of the 
main surveys: 

• CfA-Harvard survey, telescope of 1.2m diameter, beam 9’, sampling 0.12°, 
Northern and Southern hemisphere (Dame et al. 1987, 2001), sky coverage 
of about 50% until |5| < 32°, cf. Fig. 7. 
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Fig. 7. {Top) Velocity integrated CO map of the Milky Way (with mostly 9’ reso- 
lution over most of the map). The main molecular clouds are indicated. (Bottom) 
Longitude-velocity map of CO emission integrated over 4 degrees in latitude cen- 
tered on the Galactic Plane, from Dame et al. (2001). 



• Bell Labs survey, telescope of 7m, beam 1.7’, CO, ^^CO, CS in galactic 
center (Bally et al. 1987, 1988). 

• NRAO-Kitt Peak survey, telescope of 12 m, beam 60”, fraction of sky 
covered 10“^ (Burton & Gordon 1978; Solomon et al. 1979). 

• Massachusetts-Stony-Brook survey, FCRAO telescope of 14m, beam 45”, 
sky coverage 10“^ (Sanders et al. 1986). 

• CO(2-l) survey with Nobeyama 60 cm telescope, beam 9’, Sakamoto et 
al. (1995); the average CO(2-1)/CO(1-0) ratio found for the Galaxy is 
0 . 66 . 

• surveys also made in Bordeaux (beam 4.4’, Jacq et al. 1988), 
Columbia (9’ Bronfman et al. 1988), etc. 



2.2 CO Distribution and Spiral Structure of the Milky Way 

Since we are located in the plane of the Galaxy itself, the derivation of dis- 
tances along a given line of sight is complex. The main method to obtain dis- 
tances is kinematical, from the observed radial velocity of the emission, with 
the help of a model for the rotation of the Milky Way. It is assumed in general 
that the rotation is circular, and streaming motions are ignored, which could 
alter significantly the derived 3D- view of the Galaxy (e.g. Combes 1991). 

The rotation curve is first obtained from the observation of terminal ve- 
locities, at the points where the circular velocity is parallel to the line of 
sight (see Fig. 8). The locus of these tangent points is a circle of diameter 
the segment Sun-Galactic Center. 

For each line of sight of longitudes I lower than 90°, there are two points 
with the same projected radial velocity, but one at a near distance and the 
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R(kpc) r(kpc) 

Fig. 8. (Top) Schematic diagram showing that two points on the line of sight 
may have the same projected radial velocity with respect to the observer. There 
is ambiguity of kinematic distances between a near point (inside the circle) and 
a far point (outside). The circle is the locus of points where the circular velocity 
is exactly parallel to the line of sight, and the observed radial velocity maximum 
(terminal velocity of tangent points). {Bottom left) Rotation curve of the Milky 
Way: the stars are data points (cf. Fich & Tremaine 1991), and the curve a model 
fitted to the data. {Bottom right) The observed radial velocity as a function to the 
distance to the observer, for a longitude of 45° where there is distance ambiguity, 
and for I — 135° where there is not. 



other outside the locus of tangent points. The observed radial velocity as a 
function of the longitude I and the distance to the point R is (Fig. 8): 

Vrad(f, 1) = Rq sin I {n{R) - Hq) , (4) 

where 17 is the angular rotational velocity. To remove the degeneracy, the 
latitude or height above the plane can play a role statistically, the nearest 
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points appearing at a higher latitude. Also the distance of close-by stars, 
determined by their spectrum or by absorption (in front or behind the clouds) 
may be used to raise the distance ambiguity. 



Radial Distribution: Once the distance of the emitting material is known, 
it is easy to invert the observed longitude-latitude map and find the galacto- 
centric ppmap, and the radial distribution in the Galaxy. One has to remind 
that all these steps are model dependent, and in particular the radial dis- 
tribution and the spiral structure are considerably perturbed by the ignored 
streaming motions. 

The most striking features of the radial distribution are (see Fig. 9): 

• the large concentration in the center (inside 1 kpc in radius, corresponding 
to the decoupled circumnuclear disk), 

• the relative depletion around 2 kpc, 

• the Galactic molecular ring between 4 and 8 kpc, which concentrates most 
of the disk emission. This ring might be decomposed in spiral arms, but 
the problem is to disentangle the actual distances, in the spiral arm region 
where streaming motions are important, 

• the quasi exponential radial decrease afterward. 





Fig. 9 . (Left) Radial distribution of H2 surface density derived from CO emission, in 
the southern and northern hemisphere, from Bronfman et al. (1988). (Right) Com- 
parison between the HI and H2 derived surface densities, as a function of galactic 
radius. The data inside Rq = 8.5 kpc are from a compilation of the various surveys 
presented here (see Burton 1992), and outside the solar circle, from Wouterloot et 
al. (1990). 
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Fig. 10. An N-body+SPH simulation from Fux (1999) reproducing most of the 
observed features in the region |Z| < 35°. (Left) Face-on projection of the gas 
distribution, with the location of the observer indicated by the © symbol. The 
bar inclination angle is 25° and the corotation radius 0.55 relative to the Sun’s 
galactocentric distance. (Right) l — V diagram based on the gaseous particles within 
|b| < 2°. 



Spiral Structure: To determine the overall spiral structure of our Galaxy 
is a difficult task, although there is evidence of strong and contrasted spiral 
arms in the I — V diagram, for instance: there are strongly depleted region 
and a large contrast, and this cannot occur in a smooth axisymmetric disk 
(see Fig. 7). 

There is also evidence of a barred structure inside the solar circle, through 
the highly non-circular motions. If the motions are assumed only circular, 
these elliptical streamlines are interpreted as radial expansion due to explo- 
sions (for a long time, peculiar features were dubbed “expanding” , such as 
the expanding molecular ring in the center, or the expanding 3kpc arm). 

Since it is too model dependent to deproject the observed I — b—V data 
into a face-on view of the Milky Way, the best tool to better understand its 
structure is to project realistic simulations in the I — V diagram. Satisfying 
models have been published by Mulder & Liem (1986) or Fux (1999), as 
shown in Fig. 10. 

2.3 The Inner Galaxy 

The center of the Galaxy has always been a big puzzle: large forbidden veloc- 
ities are observed; they do not fall to zero towards zero longitude, as expected 
for a disk in pure circular motion. Guriously, although the hypothesis of a bar 
potential had been proposed (Peters 1975) and was realistic, since more than 
two thirds of spiral galaxies are barred, the explosion and expansion model 
was more discussed (Oort 1977; Sanders & Bania 1976). 

Now the bar has been directly seen in the near-infrared image of GOBE- 
DIRBE, through a perspective effect: the near-side of the bar being signifi- 
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Fig. 11. Molecular gas in the inner Galactic disk, showing the circumnuclear disk: 
(Left) I — V diagram in *^^00, integrated over latitudes |fe| < 0.6°, from Bally et 
al. (1988). Note the “expanding molecular ring or EMR”, Clumps 1 and 2, and 
3kpc arm features. The main lane at F = 0 is coming from the molecular ring in 
the Galactic disk, farther from the Galactic center. (Right) I — V diagram in ^^CO, 
integrated over latitudes |6| < 0.1°, from Bally et al. (1987). The central nuclear 
disk forms a characteristic “parallelogram” feature. 



cantly less distant from us than the more remote side, its height above the 
plane is larger (Dwek et al. 1995). 

Moreover, the characteristic features in the l — V diagram, the non-circular 
motions giving rise to a peculiar parallelogram geometry (Fig. 11) can be 
explained by the shape of the periodic orbits in a bar potential: the x\ orbits, 
parallel to the bar, are highly elongated, and may also reveal a cuspy shape 
at their extremities. Inside the parallelogram, the diagonal feature could be 
due to the X 2 orbits, that are perpendicular to the main bar, and of nearly 
circular shape (Binney et al. 1991). Taking into account the perspective effect 
can also explain the apparent offset for the center of the parallelogram (Fux 
1999, Fig. 12). The simulations show also that the center of the bar may 
wander around the (l,b) = (0,0) position (Fux 1999), and part of the offset 
could be intrinsic, and non-stationary. The 3 kpc “expanding” arm is one of 
the conspicuous arm around the bar. 

If the parallelogram is interpreted as the leading dust-lanes characteristic 
of barred galaxies, the high velocity Clumps 1 and 2 or velocity-elongated 
features at low longitudes in the I — V diagram (Fig. 11; Bania et al. 1986, 
Stark & Bania 1986) are gas lumps crossing the dust-lane shocks. The bar is 
inclined with respect to our line of sight to the Galactic center by 25°, the 
corotation radius is at 4.5 kpc, which is also the approximate radius of the 
bar (Weinberg 1992; Fux 1999). Other features remain to be interpreted such 
as the inclination of the nuclear disk with respect to the main plane, and the 
lopsidedness. 

The very large observed velocities in the circumnuclear disk, together with 
the relative depletion just outside (at about 2 kpc radius) strongly suggest 
that the nuclear disk is decoupled from the main disk, as is frequently the 






120 



F. Combes 




K Uiwl 




4H 30 0 -ao -'O 

1 IdcgJ 



Fig. 12. The interpretation of the parallelogram feature in the central Galaxy, 
from the N-body+SPH simulation by Fux (1999). (Left) Face-on view of the gas 
distribntion, with the identification of some xi and X 2 orbits. [Right) l — V diagram 
based on the gaseons particles within |fe| < 2°, and reproducing the same orbits. 



case in external galaxies (Rubin et al. 1997; Emsellem et al. 2001). It is even 
likely that a secondary bar is rotating at a different pattern speed from the 
primary bar, and is dynamically decoupled. Such a nuclear bar has been seen 
in the near-infrared image from 2MASS by Alard (2001). 



Nature of Molecular Clouds in the Inner Galaxy: The giant molec- 
ular clouds of the circumnuclear disk near the center have distinct physical 
parameters from those of the galactic disk. They are denser, by two orders 
of magnitude (l^ lO'^cm”^) as revealed by high density tracers such as HCN 
or CS. This can be explained by the fact that tidal forces from the galaxy 
potential are larger towards the center. Lower density clouds would have been 
destroyed by these forces, and are not present. 

To have an order of magnitude of the variation with galactic radius R of 
tidal forces, let us differentiate the gravitational radial acceleration jR\ if 
we can make the approximation of a flat rotation curve V ~ const, then the 
tidal force on a cloud of diameter d is: 







( 5 ) 



The limiting density for a cloud of mass Me is reached when this tidal force 
is equal to the internal gravity force ~ AGMc/d^ . The minimum density of 
clouds is thus: 



_ 3 

“ 4ttGW 



103cm-3(^2^^ 



R 



2 



(6) 
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Below this critical density, clouds are sheared off to the diffuse medium. In 
addition to a larger density, clouds have also a higher velocity dispersion in 
the center, due to the Toomre criterion for overall gravitational stability. 

2.4 Vertical Distribution 

The vertical distribution of molecular clouds together with their vertical ve- 
locity dispersion, is a good diagnostics of the dynamics of the clouds. Since 
we are inside the plane, these quantities are somewhat uncertain. It is clear 
from the (l,b) map that the molecular plane is much thinner than the HI 
plane. However there are projection effects for the local gas, and the warp in 
the outer plane for the HI. The vertical velocity dispersion CTg is obtained in 
the Milky Way, at the tangent points (Malhotra et al. 1994, 1995), where the 
crowding of matter allows a better statistics. In external galaxies, it is more 
easily obtained in face-on objects, while the actual thickness hg is observed 
in edge-on systems. 



Vertical Equilibrium: In the hypothesis of a thin isothermal infinite disk, 
completely self-gravitating, the expected thickness hg is related to the velocity 
dispersion ag and the surface density /ig by: 



hg{r) 



27T^g(r) 



( 7 ) 



and the density profile is then a sech^ law. 

If the gas clouds are considered as test particles in a potential of larger 
scale height KzZ, the density has then a Gaussian profile. 



Pg = po exp 




(8) 



with a characteristic height. 



with Kz = 4:TrGpt {pt being the total density). 

Observed in the HI gas, the velocity dispersion is constant with radius, 
and equal to lOkm/s (12km/s in the center) (Dickey et al. 1990). This is 
best seen in face-on galaxies. In the molecular component, also <Tg = const, 
in the face-on galaxies NGC628 and NGG3938. The surprising observation 
is that both dispersion (atomic and molecular) are about equal (Gombes 
& Becquaert 1997). This is not compatible with two different independent 
components with different thicknesses (about 60 and 220 pc for the H 2 and 
HI respectively). This result is obtained after subtraction of the expected 
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Fig. 13. The scale-height (half width at half maximum) of the atomic and molecular 
components as a function of galactic radius, revealing the flaring of the galactic 
plane, from Wouterloot et al. (1990). 



linewidth due to the systematic gradient (rotation), and appears the same 
when the isotopic molecule is considered. 

In the Milky Way, the modelling of hg and Cg at the tangential points 
(Malhotra 1994) allows to deduce the azimuthal velocity dispersion. Again, 
no variation with radius is detected (except towards the galactic center), 
which should help to identify the heating processes (star formation, gravita- 
tional instabilities, . . . ). As for the dispersion value, there is some scatter in 
the literature, from 4 to 11 km/s, according to which scale is considered, and 
whether dumpiness of molecular clouds is taken into account or not. In aver- 
age, in the Milky Way, the dispersion is of 8km/s, averaged over 200 — 400 pc, 
and the scale-height of 75 pc in the molecular ring. The molecular plane is 
flaring, and the scale-height is slightly increasing with radius, at about the 
same rate as the HI component (Fig. 13). 

The shape of the vertical density law is not Gaussian, as expected from a 
non-self gravitating medium, but tails of small clouds are observed (Magnani, 
Blitz, Mundy 1985). The centroid of the plane departs from z = 0 more than 
the scale height, due to corrugations and warp. 

The dust layer is slightly thicker than the molecular component, but much 
thinner than the HI plane (Sodroski et al. 1987). 



Phase Transition Between HI and H2: If the velocity dispersions of 
atomic and molecular gas are comparable, this might be explained by a model 
in which both are two different phases of the same gas component, sharing 
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the same dynamics. CO is observed more concentrated to the plane, since 
the gas is becoming molecular above a certain critical density, that is reached 
only below a certain height above the plane. Molecules at z = 0 starts with 
the same velocity and will reach the same maximum height above the plane, 
but have more chances to end their orbit in an atomic phase. 

The phase transition appears rather sudden, depending on pressure, UV 
radiation, density (Imamura & Sofue 1997). The HI thickness of 200 pc is not 
explained through the turbulent velocity (cTg = 9km/s), but HI needs extra 
support to keep its height (Malhotra 1995). 

From the constancy with radius of the gas velocity dispersion, and the 
observed linear flaring, the surface density of the plane can be deduced: and 
exponential law is compatible with the data, which means a constant mass- 
to- light ratio (since the stellar density is exponential). 



Origin of the Heating: Star formation is contributing significantly in the 
center of the optical disk, which explains the increase of gas velocity disper- 
sion there. 

Elsewhere, in quiescent areas, self-regulated gravitational instabilities, fol- 
lowing the Toomre criterion for disk stability, must be the main heating 
source. 



Flaring of the Plane: The HI thickness increases linearly with radius R, 
hg = ho + 0.045i?, from Merrifield (1992). This is not only visible in HI, but 
also in the molecular plane (see Fig. 13). The flaring is due to the decrease of 
the total density in the plane, and the corresponding decrease of the restoring 
force. 

The CO (and therefore H 2 ) follows the flare, and also the warp Grabelsky 
et al. (1987). 

Warping of the Plane: The warping of the plane is spectacular in HI, 
it is mostly on one side, and asymmetrical. Corrugations are observed with 
amplitude larger than the scale-height hg. The CO follows the warp, and the 
molecular component appears even more warped, being thinner. 

This phenomenon is also observed in external galaxies, in particular M31, 
where CO is detected in some positions with 2 velocities, at each crossing of 
the warped plane. These velocities are compatible with the position-velocity 
models from Henderson (1979), and the characteristic flgure-8 shape (see 
Brinks & Burton 1984). 



High Velocity Clouds: High Velocity Clouds (HVC) are detected essen- 
tially in HI, no CO emission has been detected until now. The origin of 
HVC might be multiple, a large fraction of them belonging to the Magellanic 
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Stream, of low metallicity. Some H 2 has been detected in HVC through UV 
absorption lines (Richter et al. 2001; Tumlinson et al. 2002). 

The non-detection of CO emission is likely due to the low metallicity of 
this gas, as confirmed by Wakker et al. (1999). Whatever its origin, the infall 
of gas at Z = 0.1 solar is required to explain the chemical evolution of the 
Galaxy, and in particular the G-dwarf problem (for which an infall rate of 
1 Mq is necessary). 

2.5 Gamma-Ray Surveys 

In the Milky Way, the emission of gamma-rays of high energy {E > 100 Mev) 
is a tracer of all interstellar matter, whatever its phase. Nucleons (HI, H 2 , 
HII, . . . ) interact with cosmic rays to produce pions, that disintegrate in 
gamma-rays. Early surveys showed that the CO/H 2 conversion ratio must 
not be constant throughout the Galaxy (Wolfendale et al. 1977). 

The distribution in the Galaxy of high-energy gamma-rays extends radi- 
ally much more than what is expected from their sources (cosmic rays are 
accelerated by supernovae), and more than the gas itself (Bloemen 1989). 
Cosmic rays are thought to diffuse in the halo, and their distribution is ob- 
tained through gamma-rays themselves, assuming the gas distribution known 
(Strong et al. 1988). 

Today, the lack of gamma-rays in the center is confirmed by EGRET on 
GRO satellite. In addition, an excess towards high latitude has been detected, 
as a halo of gamma-rays above the plane (Dixon et al. 1998). Several inter- 
pretations have been proposed, either in terms of an excess of nucleons in the 
halo (de Paolis et al. 1999) or inverse-Compton emission (Strong & Mattox 
1996; Strong et al. 1999). The latter is supported by the evolution of the 
gamma-ray spectrum from the inner galaxy to high latitudes. 

2.6 Conclusion 

The interstellar medium of the Milky Way is composed of comparable amounts 
of H 2 and HI gas, with an estimated molecular mass of M(H 2 ) ~ 2— 3T0® Mq. 
However the atomic and molecular components have very different radial 
repartitions. H 2 is centrally concentrated, in a circumnuclear disk, and after 
a relative hole, is then confined in a molecular ring between 4 and 8 kpc. HI 
is rather depleted in the center, and much more radially extended. 

The molecular component is much more clumpy, and composed of a hi- 
erarchical cloudy structure, up to giant clouds of 10® Mq in mass. The H 2 
plane is thinner than the HI plane, but the velocity dispersions of both gas 
are comparable. The flaring and the warp are shared by the two components. 
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3 The Fractal Structure of the Molecular Gas 

3.1 Self-similarity of Clouds 

The molecular interstellar medium is very clumpy and fragmented. Its hi- 
erarchical structure can well be described by a fractal, because of its self- 
similarity. It has no characteristic scale. Fractals by definition are self-similar 
ensembles, that have a non-integer, i.e. fractional dimension (Mandelbrot 
1975). The Hausdorff dimension D determines whether a system is homoge- 
neous, and what fraction of space is filled. While a homogeneous system has 
a mass increasing as the 3'''^ power of the scale r (in three dimensions), a 
fractal medium may occupy a tiny fraction of space, and its mass is growing 
as M oc r^, with D lower than 3 (D Ri 1.7 for the interstellar medium). 

The ensemble of molecular clouds, as many fractals considered in physics, 
are only approximations of mathematical fractals. They are self-similar only 
between two limiting scales, where boundary effects occur, while a pure math- 
ematical fractal is infinite; and they are quite randomly distributed, their 
self-similarity being only statistical (see Fig. 14). 

The self-similar structures in the interstellar medium extends over 6 or- 
ders of magnitude in scale, from about 10“^ to 100 pc, and these are not 
observed for the same molecular cloud only because of technical problems, 
lack of spatial resolution on one side, and difficulty of mapping too large areas 
on the other. The scaling relations all over the scales are however obtained 
by comparing various clouds observed with different resolutions, using the 
dynamics of distances in the Galaxy (from 50 pc to 20 kpc) . 




Fig. 14. (Left) IRAS 100 /xm map of molecular clouds towards the Taurus complex, 
located at about 100 pc from the Sun. The far-infrared emission is from heated dust. 
The square is ~ 4000 pc^. (Right) Zoom of the central region (the square is now 
~ 400 pc^). 
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3.2 Limits of the Fractal 

The largest self-gravitating entities in the Galaxy are the Giant Molecular 
Glouds (GMG) of about 100 pc in diameter, and 10® Mq in mass. Larger 
clouds cannot exist since they would be teared off by the galactic shear, i.e., 
the tidal forces due to the galactic potential itself. This is the high cut-off 
scale in the fractal structure. 

As for the small-scale cut-off, it is difficult to observe directly in emission 
the smallest structure, due to lack of spatial resolution and sensitivity. But 
structures of about 10 — 20 AU in size (i.e., ~ 10“'^ pc) have been observed for 
a long time through scattering of the quasar light (Fiedler et al. 1987, 1994; 
Fey et al. 1996) and also in front of pulsars (Frail et al. 1994): clumps in the 
electronic density diffract the light rays from remote quasars, and produce 
an “extreme scattering event” (ESE) lasting for a few months, in their rapid 
motion (100 — 200km/s) just in front of the quasar. QSOs monitoring during 
several years has determined that the number of scattering structures is 10® 
times as numerous as stars in the Galaxy. The problem of stability and life- 
time of these structures, with much higher pressure than surroundings, has 
long been debated. Heiles (1997) proposes that these TSAS (tiny scale atomic 
structures) are cold filaments or sheets, that appears dense in projection. The 
problem can be solved if they are self-gravitating (Walker & Wardle 1998); 
they are then of 10~® Mq in mass, and have a gas density around 10^® cm“®. 
They correspond to the smallest fragments predicted theoretically (Pfenniger 
et al. 1994, Pfenniger & Gombes 1994). These structures are now observed in 
a large number directly, through VLBI in the solar neighborhood, through HI 
absorption in front of quasars (e.g.. Diamond et al. 1989; Davis et al. 1996; 
Faison et al. 1998). If this 10 — 20 AU size is adopted for the low cut-off scale 
of the fractal structure, the latter ranges over 6 orders of magnitude in size, 
and about 10 in masses. 

3.3 Scaling Laws 

Power-law relations between size, linewidth and mass quantify the self-similar 
structure (cf. Larson 1981). These scaling relations are observed whatever 
the tracer. The original one is the size-linewidth relation (directly derived 
quantities), while the mass is only a secondary quantity, very uncertain to 
obtain, since there is no good universal tracer. The relation between the sizes 
R and the line-widths or velocity dispersion cr, can be expressed through the 
power-law: 

afxR\ ( 10 ) 

with q between 0.3 and 0.5 (e.g., Larson 1981; Scalo 1985; Solomon et al. 1987; 
cf. Fig. 15). There are evidences that molecular clouds are virialised at least 
at large scale; it is more debated at small scales, where there is no good tracer 
of the molecular mass (in particular the conversion factor between GO and 
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Fig. 15. {Top) Size-linewidth relation taken from various sources: Dam 86: Dame et 
al. (1986); Sol 87: Solomon et al. (1987); Heit 98: Heithausen et al. (1998); MBM85: 
Magnani et al. (1985); Will 94: Williams et al. (1994); Fal92: Falgarone et al. (1992); 
W 95: Wang et al. (1995); Ward 94: Ward-Thompson et al. (1994); Lem 95: Lemme 
et al. (1995). An indicative line of slope 0.5 is drawn. (Bottom) Mass-size relation 
deduced from the previous one, assnming that the strnctures are virialised. The 
line drawn has a slope of 2. 



H 2 mass could be higher, and the mass at small-scale underestimated). If the 
virial is assumed at all scales, then: 



2 M 



and the size-mass relation follows: 



( 11 ) 



M (xR^ , 



(12) 



with D the Hausdorff fractal dimension between 1.6 and 2. The mean density 
over a given scale R decreases thus as where a is between 1 and 1.4. 

High Latitude Clouds (Heithausen 1996; Magnani et al. 1985) also follow 
these relations. 

When considering the estimation of the fractal dimension, it has to be 
taken with caution, because of the observational biases. The main tracer 
used is the CO molecule, which is optically thick. Radiative transfer in a 
fractal is complex, and the peculiar systematic effects are unknown. Some 
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simple simulations of fractals show already large trends (Pfenniger & Combes 
1994). 

The fractal structure, and its self-similarity, is broken in regions of star 
formation. The break is observed as a change of slope in mass spectrum power 
laws at about 0.05 pc in scale in the Taurus cloud, corresponding to the local 
Jeans length (Larson 1995). In other regions, either this break does not occur, 
or is occurring at larger scales (Blitz & Williams 1997; Goodman et al. 1998). 

The observations of molecular clouds reveal that the structure is highly 
hierarchical, smaller clumps being embedded within the larger ones. This 
structure must be reminiscent of the formation mechanism, through recursive 
Jeans instability for instance. Since we have no real 3D picture, it is however 
difficult to ascertain a complete hierarchy, or to determine the importance of 
isolated clumps and/or a diffuse intercloud medium. An indicator of the 3'^'^ 
dimension is the observed radial velocity, which is turbulent without system- 
atic pattern. It has been possible, however, to build a tree structure where 
each clump has a parent for instance for clouds in Taurus (Houlahan & Scalo 
1992). 



Filamentary Structure: The scaling relations do not characterize com- 
pletely the cloud ensemble, and for the same fractal dimension, many ge- 
ometries are possible. Molecular clouds have a filamentary structure, with 
aspect ratio close to 10 or larger. About 30% of the mass is contained in 
the filaments, which appear like abrupt transitions, like shock waves; their 
confinement could be due to gravity. 

To characterize the self-similar structures in projection, several authors 
have computed the relation between the surface and the perimeter of these 
structures. This method has been used in observed 2D maps, like the IRAS 
continuum flux, or the extinctions maps of the sky. In all cases, this method 
converge towards a fractal dimension D 2 , i.e., the perimeter P and area A 
are related by 

(13) 

Falgarone et al. (1991) find a dimension D 2 = 1.36 for CO contours both at 
very large (degrees) and very small scales (arcmin), and the same is found 
for IRAS 100 /X contours in many circumstances (e.g., Bazell & Desert 1988). 
Comparable dimensions {D 2 between 1.3 and 1.5) are found with any tracer, 
for instance HI clouds (Vogelaar & Wakker 1994). 

The projection of a fractal of dimension D may not be a fractal, but if 
it is one with dimension Dp it is impossible a priori to deduce its fractal 
dimension, except that (Falconer 1990): 

Dp = D if D <2 , 

Dp = 2 if D>2 . 



(14) 

(15) 
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The mass spectrum of interstellar clouds is another information, charac- 
terizing further the cloud ensemble. It has been estimated as: 



dm 



(16) 



with 7 ^ 1.6 — 1.9 (Heithausen et al. 1996, 1998). 

This power law is not related to the fractal dimension, but to the ge- 
ometry of the fractal, and its hierarchical character. If the fractal is entirely 
hierarchical, the power-law expected is 2, since mN(m) = const, or 



n{m)dlogm(xm ^ dlogm , 
and all the mass is included in the smallest fragments. 



(17) 



Formation by Recursive Jeans Fragmentation: The hierarchical fractal 
structure yielded by recursive fragmentation can be simulated schematically, 
in order to estimate the resulting filling factor, and the biases introduced by 
observing with a limited spatial resolution (e.g., Pfenniger & Combes 1994). 
Clouds are distributed according to the radial distribution of an isothermal 
sphere in and fragments also have positions selected randomly accord- 
ing to this radial distribution, and so on, hierarchically. The number N of 
fragments at each level, and the fractal dimension D chosen are sufficient to 
determine the ensemble. The recursive relations between two successive scales 
L and L — 1 are Ml = NMl-i, = Nr^_^ and the size ratio between to 
imbricated spheres is a = tl-i/tl = . A sample of these fractal pro- 

jections is shown in Fig. 16, for two dimensions 2.2 and 1.6. Typically fV = 10 
and L = 9. 

The resulting surface filling factor is a strong function of D, and is less 
than 1% at D = 1.7. This low filling fator will lead to mass underestimate 
when observed with low spatial resolution. The large range of scales is also a 
source of bias, and of overestimates of the fractal dimension: the small scales 
are not resolved, and observed maps are smoothed out. This process, which 
makes the fractal look as a more diffuse medium of larger fractal dimension, 
can also lead to underestimates of the mass by factors more than 10 (e.g., 
simulations in Pfenniger & Combes 1994, and Fig. 16). 

3.4 Turbulence 

The interstellar medium is highly turbulent, its Reynolds number Rg = vl/v, 
where v is the velocity, I a typical dimension, and v the kinematic viscosity, 
is of the order of ~ 10®, because of its very low viscosity. The advection term 
V . V V dominates the viscous term in the fluid equation. 

In a first approach, the interstellar structures may be interpreted in terms 
of incompressible turbulence. Indeed the observed size-linewidth relation is 
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Fig. 16. {Left top) Simulations of a fractal of dimension D — 1.6 built by recursive 
fragmentation. {Left bottom) Same for a fractal of dimension D = 2.2, the number of 
points are similar in the two cases. {Right) Cumulated mass as a function of surface 
density S in various simulated models with different fractal dimensions. Are plotted 
the ratios (in log-scale) of what would be obtained for an homogeneous medium 
with D = 3 to the model with D = 2.5 {dots), D — 2 {dash) and D = 1.5 {long 
dash). This shows that estimated masses can be wrong by 2 orders of magnitude 
(from Pfenniger & Combes 1994). 



very similar to what is expected in the Kolmogorov cascade (Kolmogorov 
1941). In this picture, energy is dissipated into heat only at the lower scales, 
while it is injected only at large scale, and transferred all along the hierarchy 
of scales. Writing that the energy transfer rate v‘^/{r/v) is constant gives the 
relation 

V oc (^8) 

which is close to the observed scaling law, at least for the smallest cores 
(Myers 1983). The source of energy at large scale could be the differential 
galactic rotation and shear (Fleck 1981). This idealized view has been debated 
(e.g., Scalo 1987): it is not obvious that the energy cascades down without 
any dissipation in route (or injection), given the large-scale shocks, flows, 
winds, etc. observed in the ISM. The medium is highly supersonic (with 
Mach numbers larger then 10 in general), and therefore very dissipative. 
Energy could be provided at intermediate scale by stellar formation (bipolar 






Molecules in Galaxies at All Redshifts 



131 



flows, stellar winds, ionization fronts, supernovae, . . . ). Also, the interstellar 
medium is highly compressible, and its behaviour could be quite different 
from ordinary liquids in laboratory. However a modified notion of cascade 
could still be applied, leading to a Burgers spectrum, with 

V (X (19) 

(see e.g. Vazquez-Semadeni et al. 1999). 

Besides, many features of ordinary turbulence are present in the ISM. 
For instance, Falgarone et al. (1991) have pointed out that the existence of 
non-Gaussian wings in molecular line profiles might be the signature of the 
intermittency of the velocity held in turbulent flows. Also the curves obtained 
through 2D slicing of turbulent flows have the same fractal properties as the 
2D projected images of the ISM; their fractal dimension D 2 obtained from 
the perimeter-area relation is also 1.36 (Sreenivasan & Meneveau 1986). 



3.5 Numerical Simulations of Turbulence 



The magnetic held may play an important role at small scale, through energy 
exchange between the various degrees of freedom, although it cannot prevent 
cloud collapse, and MHD simulations have been considered. The intensity 
of the held has been measured through Zeeman line effects to be around 
a few /iG in dense clouds (Troland et al. 1996; Grutcher et al. 1999), and 
the observations are compatible with the hypothesis of equipartition between 
magnetic and kinetic energy. 

A large number of hydro and MHD simulations have been run, however, 
they suffer in general of too low a dynamical range, due to huge computa- 
tional requirements. To gain in spatial resolution, 2D or even less (because 
of symmetries) computations are performed, but often the results cannot be 
generalised to 3D. 

When heating and cooling processes are included, and since the corre- 
sponding time-scales are faster than the dynamical time-scales, the gas can 
be described by a polytropic equation of state, as P oc with 7 being the ef- 
fective polytropic index. The isothermal value 7 = 1 is one of the possibilities, 
between 0 and 2 found in simulations. 

From the size-linewidth relation a oc and the second observed scal- 

ing law p oc it can be deduced that 

(7 oc (20) 



and therefore, if the turbulent pressure P is defined as usual by. 



dP 




(21) 



it follows that 



P oc log p 



( 22 ) 
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Fig. 17. {Left) 2D MHD self-gravitating simulations of a tnrbulent medium, with 
800 X 800 resolution, from Vazquez-Semadeni et al. (1997). {Right) Size-density 
relation found in these simulations of turbulence. The straight line has power —1, 
as observed in the interstellar medium. In the simulations, there is a large range of 
scales, for a given density. 



which is the logatropic equation of state, or “logatrope” . This behaviour has 
been tested in simulations (e.g., Vazquez-Semadeni et al. 1998), but the logat- 
rope has not been found adequate to represent dynamical processes occurring 
in the ISM (either hydro, or magnetic 2D simulations). The equation of state 
of the gas would be more similar to a poly trope of index 7 « 2 . 

Vazquez-Semadeni et al. (1997) have searched for Larson relations in the 
results of 2D self-gravitating hydro (and MHD) simulations of turbulent ISM: 
they do not find clear relations, but instead a large range of sizes at a given 
density, and a large range of column densities (see Fig. 17); the Larson relation 
is more the upper envelope of the region occupied by simulated points in 
the p — R diagram. They suggest that the observational results could be 
artifacts or selection effects (existence of a threshold in column density for 
UV-shielding for example). 

Note also that chemical reactions network, combined with turbulence, can 
be the source of considerable chaos (Rousseau et al. 1998). 

3.6 Self-gravity 

In turbulent simulations with self-gravity, most power is on small scales 
due to gravitational collapse (Ossenkopf et al. 2001) in contrast to non-self- 
gravitating hydrodynamic turbulence where large-scale structures dominate. 
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Gravothermal Catastrophe: Self-gravitating gas in an isothermal regime 
is known to be subject to a gravothermal instability. A sphere of gas confined 
in a box, and in contact with a thermostat, will tend to follow an isothermal 
sphere distribution, if the gas is hot enough. Below a certain critical temper- 
ature, there is no equilibrium any more, and the gas heats up when being 
cooled down, it is the gravothermal catastrophe, caused by negative specific 
heat (Lynden-Bell & Wood 1968). Small sub-condensations could form, and 
the physics will be more complex, since in the asymptotic case of an isolated 
clump, it will evaporate in a large number of dynamical times. The environ- 
ment is quite important, and the fact that the clumps can exchange mass 
and energy with surroundings as well (e.g. Padmanabhan 1990). 

Gravitational instability and cloud collapse is accompanied by fragmenta- 
tion in a system with very efficient cooling, and this process can provide the 
turbulent motions observed. The theory was first proposed by Hoyle (1953) 
who showed that the isothermal collapse of a cloud led to recursive fragmen- 
tation, since the Jeans length decreases faster than the cloud radius. Rees 
(1976) has determined the size of the smallest fragments, when they become 
opaque to their own radiation. They correspond roughly to the smallest scales 
observed in the ISM (sizes of 10 AU, and masses of 10“^ Mq, see the physical 
parameters of the “clumpuscules” in Pfenniger & Combes 1994). The time- 
scale for the whole hierarchy of clouds to recursively fragment is of the same 
order of the free-fall time-scale of the biggest structure Tg- If the size-ratio 
between two successive scales is as before: a = the time 

of collapse of the whole structures is: 



T 

Tff 



1 1 



with k = (5 fragments)^/"* ~ 1.5. 



k 

k-1 ’ 



(23) 



Simulations: Simulations of self-gravitating gas are very demanding, since 
large gradients rapidly set up, and spatial resolution must be adapted. A 
general rule is that the resolution is well below the Jeans length at any point, 
but the Jeans length shrinks along the collapse. Artificial fragmentation can 
sometimes happen due to artifacts (see e.g. Truelove et al. 1997). To compen- 
sate for the limited spatial range, periodic boundary conditions are used, to 
simulate the ISM. Klessen (1997) and Klessen et al. (1998) have considered 
the fragmentation of molecular clouds, in 3D, starting with an homogeneous 
cloud with small primordial fluctuations. These initial conditions are very 
similar to what is used in a cosmological background. The fluctuations are a 
Gaussian density field with power spectrum P{k) oc After one free-fall 
time, the gas has evolved into a system of filaments and knots, some of them 
contain collapsing cores. To avoid the problem of spatial resolution, the con- 
densed cores are then replaced by sink particles, simulating therefore a low 
cut-off scale, cf. Fig. 18. 
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Fig. 18. SPH simulation of a molecular cloud with initially 222 Jeans masses, and 
initial Gaussian density fluctuations with power law P{k) oc k~^. The collapse 
results in highly condensed cores, which grow by accretion from the surrounding 
gas. These cores {black dots) contain about 10% of all the gas mass at t = 1.6 free- 
fall times, then t — 2.0 and t = 2.8, 30% and 60% respectively (from Klessen et 
al. 1998). 



Klessen et al. (1998) compute the mass spectrum of clumps, which looks 
very similar to the observed one dN/dm oc ® in the ISM, at least over 
1.5 order of magnitude. The results however, depends still a lot on the initial 
conditions, and the power spectrum used (Semelin & Combes 1999). One of 
the problem of these simulations, in comparison with the ISM, is the ab- 
sence of energy re-injection, and systematic motions. In a Galaxy, differential 
rotation and shear should provide both. 

3.7 Molecular Fractal and IMF 

Since young stars form out of molecular clouds, it is interesting to search for 
the mass spectrum of new stars implied by the fractal structure of the clouds, 
and their own mass spectrum. 
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Several theories have been proposed to explain the IMF, at least 4 types 
of models: 

1. From the formation of a single star, variation of parameters to get sev- 
eral star masses (Larson 1973; Silk 1977; Zinnecker 1984); in particular, 
random variations with time and positions. 

2. Stars form in clusters, and protostars interact with each other through 
collisions and coalescence (Bastien 1981; Murray & Lin 1996). Observa- 
tions do show that most stars are born in clusters. 

3. Star formation in clusters, with competitive accretion (Larson 1978; 
Tohline 1980; Myers 2000). A nearly uniform reservoir of gas is assumed. 
But dense clumps are observed (e.g. Motte et al. 1998), with a small 
filling factor and they don’t favor accretion. 

4. Observations favor stars forming in pre- formed clumps. Therefore the 
IMF does not give any clue in star formation, but only in cloud forma- 
tion. Using random sampling of mass in a fractal hierarchy of molecular 
clouds gives the power-law IMF for stars, provided that a star’s mass 
is proportional to the mass of the clump in which it forms (Elmegreen 
1997). This power-law is steeper than the cloud mass spectrum because 
of competition for mass. 

The IMF has two characteristic masses, and 3 slopes (Scalo 1998). The 
lower-mass characteristic scale at 0.3— IMq is much larger than the smallest 
cloud fragment, and must have formed by accretion and collapse of smaller 
clumps. The smallest mass for the fragments is that of opacity- limited clumps, 
of the order of 10“^ Mq. Accretion must operate to reach the first stellar mass, 
and may be form brown dwarfs in between (Rees 1976; Yoshii & Saio 1985). 

The high-mass cut-off is due to a self-limiting accretion, due to stellar 
winds, and depends on ambient gas density and the mass of the cloud it is 
embedded. Stars grow until this self-limiting mass, a little larger than deu- 
terium burning limit (Larson 1982; Shu et al. 1987). There must be a certain 
scatter between initial cloud masses and the final star mass, due to a se- 
ries of unpredictable dynamical events, accretion, fragmentation, turbulence, 
self-gravity, . . . 

In the case of a pure hierarchy of clouds, containing equal mass per loga- 
rithmic interval, (see Sect. 3.3): 

n{m) dlogm (X m~^ dlogm . (24) 

The probability to have a random mass m is then: 

m~^ dlogm fx m~^ dm . (25) 

This is close to what is observed, n(m) oc (Heithausen et al. 1998). 

Stars also would have this kind of law, except that they compete for gas. 
Once a star is formed at a certain level of the hierarchy, the higher scale 
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has no longer any gas to form other stars. At a higher level, a star that 
normally would have contained much higher mass, because comprising a lot 
of subclumps, has now a mass lower than expected from the cloud hierarchy. 
This results in a steepening of the slope of the mass spectrum for the stars 
with respect to the clouds. Stellar clusters don’t compete for mass, since 
they have the total mass of their components, and indeed they obey also the 
relation n(m) dlogm oc m~^ dlogm. 

Another parameter also steepens the IMF, the time-scale of star for- 
mation, which is larger for high masses: tq oc oc . Star forma- 

tion is a dynamical process, and the fractal re-adjusts its structure during 
the process. The two factors for steepening can explain the Salpeter IMF 
n(m) dlogm oc dlogm, instead of that of the pure hierarchy law, as 

shown by random sampling simulations (Elmegreen 1997, 1999, see Fig. 19). 

If the simulations have no upper-mass limit, the IMF is prolonged to 
Afup = 7000 Mq. But the maximum observed is 120 — 150 Mq. This comes 
from an Eddington-limited accretion (Norberg & Maeder 2000), or oscilla- 
tions and instabilities in massive stars. There could also be a problem of tim- 




Fig. 19. Simulations of random sampling of mass in a fractal hierarchy of molecular 
clouds: (Top left) IMF slopes versus logarithm of mass, in 100 models that differ 
in the sequence of random numbers used to generate and sample the fractal cloud 
tree. Each IMF slope is fit using 200 stars. The Salpeter value of —1.35 is shown 
by a dashed line. The three values indicated by crosses have their complete IMFs 
shown in the other panels. The fitted portions of these IMFs are indicated by the 
offsets, with two different bin sizes shown (from Elmegreen 1999). 
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ing. After the smaller stars have formed, then a large fraction of the cloud is 
used up, and this leads to cloud destruction, well before a whole GMC can 
form a massive star. 



Fractal Structure in Galactic Star Fields: Young stars just formed out 
of the fractal molecular clouds should reflect also a fractal structure, before 
diffusion and scattering occurs. Indeed archive HST images of 10 galaxies, 
Gaussian smoothed at various scales from lOkpc to kpc along the spiral arms, 
show a hierarchical structure (Elmegreen & Elmegreen 2001). The densest 
structures are like the Pleiades, at the bottom of the hierarchy. The observa- 
tions of NGG 2207 displayed in Fig. 20, show the relation on their sizes L in 
projection: 

n{L) d\og{L) oc L~^ d\og{L) , (26) 

with D = 1.12. Models with a fractal structure in 3D, depending essentially 
on the center positions and not on the shape of the clouds, reproduce these 
observations. Note also that the number of HII regions in external galaxies 
obey n{L) oc dL (Kennicutt et al. 1989; Oey & Glarke 1998). 

Stars form in the densest part of the fractal structure of the clouds. They 
keep the global fractal structure at least for a few dynamical times of the 
largest structures. The dynamical time is short at small scales (in r^/^), 
where the stars have time to disperse, but at large scale the fractal survives. 

3.8 Conclusions 

Molecular clouds have a fractal structure, over 9 orders in mass and 6 orders 
in scale. The fractal dimension is D = 1.7 — 2. It has a filamentary geometry, 
and is highly hierarchical, almost a pure hierarchy according to the mass 
spectrum. 

Turbulence and self-gravity are the key physical phenomena at the origin 
of the fractal structure. It is in this fractal structure of clouds that stars are 
formed, and the IMF may result from this structure. 



4 Stability and Formation of the Fractal 

4.1 Stability of the Molecular Disk 

The conditions of stability of a dissipation-less component like the stellar one, 
is relatively well known. High enough velocity dispersions are required to sup- 
press axisymmetric instabilities and even spirals, bars or ^-instabilities. The 
latter provide heating to the medium which becomes un-responsive. The gas 
is dissipative and has a completely different behaviour; it is always unstable, 
and its velocity dispersion is fixed by regulation and feedback. 

The local stability criterion has been established by Toomre (1964): sta- 
bilisation is obtained through pressure gradients (velocity dispersion c) at 




138 



F. Combes 



1 . 2 




Fig. 20. Six levels of Gaussian smoothing of a star-forming patch in the galaxy 
NGC 2207. The number of pixels in the Gaussian smoothing function is shown in 
each panel (from Elmegreen & Elmegreen 2001). 



small scale, smaller than the Jeans length: Aj = cts = cj[2'KGpY^'^ ■ In pres- 
ence of a self-gravitating disk, in the isothermal approximation, with surface 
density A7 = hp. the scale-height is given hy h = /(2nGS). Then the Jeans 

length is Aj = c'^I{2ttGE) = h. 

At large scale, the rotation stabilises through centrifugal forces, or the 
tidal shear prevents structure larger than Tcrit to form. The tidal forces on a 
structure of size AR are 






’^AR 



(27) 



dR 
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(where k is the epicyclic frequency). The internal gravity forces of the con- 
densation AR are: 



GSttAR^ 

AR^ 



(28) 



Equating this to F^id yields the critical size, above which structures will be 
teared off: 

Gy 

Lent ^ ^ . (29) 

The Safronov-Toomre criterion is obtained in equaling Aj and Lcrit^ 



Q 



KC 

ttGS 



> 1 . 



(30) 



For one component, a radial mode (w, k) in a linear analysis obeys the 
dispersion relation (where k is the wave number): + k^R — 2'KGEk, 

which means that self-gravity reduces the local frequency n. 

The tidal field of the galaxy is not always disruptive, it can also be com- 
pressive in the center, when there is a flat density core. If the mean density 
(p) of the spherical distribution is 



then: 





(31) 




(32) 


GM{R)/R^ gives: 




Ftid = 47tG 0 (d) - p] . 


(33) 



thus if the density is flat inside a certain radius (the core), the gas will 
be compressed, instead of being teared off. Note that the tides are always 
disruptive, in case of a power law density profile in r~'^ with 7 > 1 (Das & 
Jog 1999). In that case, 

Ftid = 4^G -l^pAR. (34) 

Can this play a role in the formation of dense nuclear gaseous disks in 
starburst galaxies? In ultra- luminous galaxies (ULIRGs), the tidal field may 
become compressive inside 200 pc (see the case of Arp 193 in Fig. 21). 

In the case of compressive tidal field, molecular clouds will not be teared 
off inside the core. However, the density to have a gravitationally bound 
cloud, leading to star formation, will be high. Indeed, an additional in- 
ward/compressive field increases the velocity dispersion required to satisfy 
the virial equilibrium, and hence the critical density required for stability. 
This minimum density is plotted in Fig. 21 for the example of NGC4472, 
inside the core. Outside, since the tidal field is disruptive, the density plotted 
is the density below which a cloud will be disrupted by the tides. 
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Fig. 21. {Left) Computed volume density for H 2 gas to exist in bound, virialized 
clouds versus radius for NGC4472. The dashed line separates the outer region of 
disruptive tidal field from the inner region of compressive tidal field. {Right) Tidal 
field per unit mass versus radius for the ultraluminous galaxy Arp 193. The field 
becomes compressive inside 220 pc (from Das & Jog 1999). 



Stability with Several Components: If a stability criterion is easy to 
derive for one component, the same is not true for a two-components fluid, 
since the coupling between the two makes the ensemble more unstable than 
each one alone. The one-component criterion can be applied separately to the 
stellar and gas components, where the corresponding values of S and c are 
used, leading to Qs and Qg. Because Cg <C Cg however, only a small percentage 
of mass in gas can destabilize the whole disk, even when Qg > 1 and Qg > 1. 
For two fluids with gravitational coupling, the dispersion relation yields a 
criterion of neutral stability = 0: 

27rG/crg 2TTGkSf; 

2 I 7,2 2 2 I 1,2 2 “ ^ 

which gives directly an idea of the relative weight of gas and stars in the insta- 
bilities (cf. Jog 1996). At low k (long waves), essentially the stars contribute 
to the instability, while at high k (short waves), the gas dominates. 



Stability and Vertical Structure: Stability criteria in general do not 
consider the thickness of galactic planes, which can be important factors of 
stabilisation. These can be represented by reduction factors for stars and gas 
/g = (U-fc/ig)“^ and /g = {l + khg)~^ of scale-heights hg and hg respectively 
(cf Romeo 1992). Note that the stabilising effects depend on wavelengths 
2tt / k. The vertical and planar heating mechanisms, and related self-regulation 
processes, are almost decoupled, as well as stability properties (see Fig. 22). 
This stabilisation is particularly important in the outer parts of galaxy disks, 
which are flaring. 

Disks Are Marginally Stable: One striking observation involving gaseous 
galaxy disks is that they have structures at all scales, either small-scale 
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Fig. 22. Two-fluid stability curves, plotting the Q parameter as a function of re- 
duced wavelength A; {Left) Case of a = 0.1, a being the surface density ratio 
between the cold (C) and hot (H) component. The various curves correspond to 
increasing Sh, which is the «- velocity dispersion of the stars. Sh — 0 corresponds 
to the infinitely thin disk. The various curves precise how much stabilisation brings 
the thickening of the stellar disk. (Right) Same for a = 0.137, corresponding to a 
larger gas fraction in a galactic disk, for instance. The low-A instabilities are due 
to the gas (from Romeo 1992). 



dumpiness, or spiral structure, etc. They appear therefore unstable. These 
instabilities could even be at the origin of the heating, since the large-scale 
velocity dispersion of the gas is remarkably constant with radius until the 
outer parts, only slightly higher in the center when there is active star for- 
mation there. It is therefore attractive to assume that disks are in some sense 
self-regulated by gravitational instabilities, and are at any time at the limit 
of stability, the gas dissipation being there to cool the disk down while insta- 
bilities are heating it. 

This hypothesis is supported by the observation of the Q parameter over 
galactic disks, that does not show large variations, but on the contrary is 
almost constant over the disks. The stellar dispersion has been studied in 
our own galaxy and in a few external galaxies by Lewis & Freeman (1989) 
and Bottema (1993). The velocity dispersion decreases exponentially with 
radius, in parallel to the stellar surface density, and Qs is nearly constant as 
a function of radius, at least for large galaxies, and vary between 2 and 3 
from galaxy to galaxy. 

How is established the stellar equilibrium? In a first approximation, the 
stellar disk can be considered as a self-gravitating infinite slab, locally isother- 
mal (az independent of z), and its density obeys the sech^{zjzQ) law in the 
2 -direction, perpendicular to the plane. The characteristic scale-height zq is a 
function of the vertical velocity dispersion az, and the stellar surface density 

T's(r) = zopo'. 
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Zo 



2TTGSs{r) ■ 



(36) 



A well-known observational result is that the stellar scale-height is a constant 
with radius in many edge-on external galaxies (van der Kruit & Searle 1981, 
1982) or only slightly increasing (de Grijs et al. 1997). Since the radial surface 
density of stars is in general exponential in exp”’’/^’", with a radial length 
of hr this means from the previous relation that the z-velocity dispersion is 
also exponential with radius, but with a radial length of 2hr- This is indeed 
compatible with the data, but with large uncertainties (Bottema 1993). In- 
deed, the z-velocity dispersion is not known for the edge-on galaxies, but the 
face-on ones, and the comparison requires to know the relation between z and 
r projection of the dispersion. In the solar neighbourhood, = 0.6(7^, and 
this ratio is arbitrarily assumed to be valid in external galaxies too. Other 
assumptions is a constant mass to light ratio for the whole stellar disk, and no 
or little dark matter within the optical disk. These are first approximations, 
and it could be that in some galaxies the stellar equilibrium is reached with 
a scale-height slightly varying with radius (e.g. Narayan & Jog 2002). 

The scale-height of the gas component is varying much more with radius, 
the gaseous plane is flaring first linearly, and even exponentially at large radii 
(Wouterloot et al. 1990). However, the vertical velocity dispersion is constant 
with radius in the outer parts of galaxies, where the rotation curve is flat 
(Dickey et al. 1990; Combes & Becquaert 1997): Cg « 6km/s. The behaviour 
of «: oc 1/r (for a flat rotation curve) is somewhat similar to the gas surface 
density Ag oc 1/r (e.g. Bosma 1981), and therefore Qg oc Cg^/Ag is constant 
with radius in the outer parts, again supporting a regulation mechanism, in 
terms of gravitational instabilities (Lin & Pringle 1987). 



4.2 Star Formation and Critical Surface Density 

The Toomre criterion applied to gas has often been used as a criterion for 
star formation (Kennicutt 1989). The criterion Qg < 1 for gas to be unstable 
can be inverted in the condition Ag > Acritj i-e., a critical surface density for 
star formation. This criterion is global, not local. 

So far, only global quantities have been correlated with success, when the 
gas surface density is averaged out over the whole galaxy, and the same for 
the star formation rate. The star formation tracer can vary, from the Ha flux 
for normal galaxies, to the Far Infrared luminosity L(FIR) for starbursts, 
which are highly obscured (Kennicutt 1998). 

While the starbursts explore a wider range and dynamics of parameters, 
the relation between the global gas surface density and star formation rate 
(SFR) is the same for extreme and normal galaxies: it is possible to derive a 
“global” Schmidt law, with a power n = 1.4 (Kennicutt 1998) 



Asfr oc Ag/^ , 



(37) 
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Fig. 23. Relations between Star Formation Rate (SFR) and gas surface densities, 
from Kennicutt (1998). (Left) A global “Schmidt” law, with a slope of n = 1.4 as the 
best fit. (Right) Linear relation between SFR and gas over dynamical time (rdyn) 
ratio. Normal galaxies are the triangles, while starburst galaxies are the squares. 



(cf. Fig. 23). Another formulation works as well 

F'sfR OC Agast? oc , (38) 

^dyn 

where f2 is the angular frequency in the galaxy, which is inversely proportional 
to the dynamical time-scale tdyn- 

Several justifications can be found a posteriori: if the star formation is 
locally due to the gravitational instability of the gas, this occurs on a free- 
fall time-scale, and the star-formation rate is: 

^P* Ps^s 1,5 

= PSFR oc — oc pg,,, , (39) 

very close to the power n = 1.4; but the correlation is not observed locally. 

Globally, this applies also, if the star formation is due to the global gravita- 
tional instability of the gas disk, that occurs in a dynamical time-scale: 

^ = Asfr oc , (40) 

(XZ ^dyn 

which accounts for the second formulation. Alternatively, star formation could 
be triggered in marginally stable clouds, by the crossing of spiral arms, and 
the frequency of arm crossing is proportional to fl — flp (Wyse & Silk 1989), 
or roughly to 17 far from corotation (where the clouds never cross the arms) . 
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This second formulation might also explain the Tully-Fisher relation (Silk 
1997; Tan 2000), since if 27sfr oc Sgas^, then 

Lb CX X'sfR-R OC T’gas'CcircT? (41) 

with OC ER from the virial, and provided that 27gas oc if* is verified 
over the main spiral classes (Roberts & Haynes 1994), it can be deduced that 



Problems with the Use of Q^: As described before, gravitational in- 
stabilities are very efficient on a dynamical time-scale, to self-regulate the 
equilibrium of disks, and therefore the values of Q are almost constant over 
galactic disks. Only in very violent off-equilibrium cases, such as galaxy inter- 
actions, mergers, leading to starbursts, is the equilibrium transiently broken 
(e.g. Combes 2001). 

Another difficulty is that if gravitational instabilities are necessary for 
star formation, they are not sufficient. There must be other essential param- 
eters controlling the onset of star formation, since in outer parts of spiral 
galaxies, and in the disks of dwarf irregulars, the gas medium has formed 
self-gravitating structures, but without star formation. 

There must be for instance a time-scale condition to have a starburst: gas 
must be gathered in a very short time-scale, smaller than ~ 10^ yr, shorter 
than the onset of feedback from the first OB stars formed, through supernovae 
explosions and stellar winds, before the starburst can blow the gas out. Also 
the volumic gas density might be considered. With the same surface density, 
a flaring gas plane should not have the same efficiency to form stars. 

4.3 Small-Scale Stability 

The stability of the smallest observed molecular clumps has always been 
a puzzle. Their free fall time is much less than 1 Myr, and the rate of star 
formation should be much larger than observed. Pressure support is necessary, 
since magnetic fields cannot halt the collapse. 

For an isothermal gas, that radiate efficiently, fragmentation cannot be 
stopped until the fragments are so dense that they become optically thick, and 
shift in the adiabatic regime. Without external perturbations, the smallest 
fragment when this occurs is about 10“^ Mq. The isothermal regime breaks 
down when the free-fall time tg becomes shorter than the Kelvin-Helmholtz 
time-scale Ikh, or time-scale to radiate the thermal energy by radiation: 

tg^tKH = i/2NkT/L with L = 4:nfR'^aT‘^ , (42) 

where / is a factor of the order of one, depending on departure from black- 
body radiation, cr is the Stefan-Boltzmann constant. This leads to a fragment 
mass (/i being the mean molecular weight) of: 



(43) 
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(e.g. Rees 1976). The average characteristics of these smallest fragments 
are thus 10“^ Mq in mass, 10^°cm“^ in density, 20 AU in radius, A^(H 2 ^ 
10^® cm“^ in column density. For these clumps, the free-fall time is ts ~ 
1000 yr, but they are optically thick, in the adiabatic regime, and the pres- 
sure support ensures that their life-time is much longer. If these clumps are 
part of a fractal, clump-clump collisions lead to coalescence, heating, and to 
a statistical equilibrium with fragmentation (Pfenniger & Combes 94). 

On the observational side, all situations are encountered: dense cores with 
isolated star formation, dense cores with clustered star-formation, dense cores 
without any star formation. The triggering of star formation could be due 
to un-balanced time-scales. An external perturbation brings a non-linear in- 
crease of velocity dispersion, due for instance to galaxy encounters. These 
trigger collisions, then either coalescence, or shredding and increase of ve- 
locity dispersion. If there is a time-delay between the formation of massive 
clouds leading to star formation, and the star formation feedback, then a 
starburst may be triggered. 

Modelisation have been studied in the literature, trying to explore the 
numerous parameters: cooling of the gas, fresh supply of gas, etc.. . . Limit 
cycles appear, and often a chaotic behaviour (e.g. Hirashita & Kamaya 2000). 

4.4 Gas in the Outer Parts 

Observationnally, the gas in the outer parts of galaxy disks is stable with 
respect to star formation, although not to gravitational perturbations. Ex- 
amples are to be found in HI-21cm maps, with clumpy structure and large- 
scale spiral structure, like in the giant spiral MlOl, or the dwarf irregular 
NGC2915, and similar conditions are encountered in Low Surface Bright- 
ness Galaxies (LSB). These environments are evidence that star formation 
and gravitational instability have not exactly the same criterion. Several ar- 
guments could be advanced to explain the non-star formation: either a low 
volumic gas density, due to a thick disk, or a flaring in the outer parts, or 
the lack of external perturbations and companions for LSB. 

A quite prototypical case is the dwarf galaxy NGG2915 (Bureau et al. 
1999). It is a blue compact dwarf galaxy, with a very extended HI disk, up 
to 22 optical scalelengths (see Fig. 24). The HI disk shows a central bar and 
extended spiral arms, that appear a coherent density wave. Both bar and 
spiral extend farther than the optical component. The velocity field reveals 
characteristic streaming motions. As in all dwarfs of these types, the rotation 
curve shows that the galaxy is dominated by dark matter all over the disk. 
The mass of the stars is smaller than the mass of gas, itself dominated by 
dark matter. 

How can the bar plus spiral pattern be explained? If the dark matter is 
assumed as a quasi spherical halo, dominating the potential, than the disk 
is not self-gravitating enough to be instable to such a perturbation. The 
Toomre Q value varies between 8 and 20 over the disk. A solution would be 
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Fig. 24. (Left) HI-21cm map of the dwarf galaxy NGC 2915 (contours), superposed 
on an optical photograph from Digitized Sky Survey; (Right) Grey scale image of 
the HI map, showing better the bar and spiral structure (from Bureau et al. 1999). 



to assume that a large fraction of the dark matter is baryonic, and follows 
the disk distribution as cold gas (Pfenniger et al. 1994, Pfenniger & Combes 
1994). Multiplying the HI surface density by a factor around 10 is in general 
sufficient to explain the rotation curves (e.g. Hoekstra et al. 2001). This 
solution indeed accounts for the formation of the bar plus spiral pattern, and 
has also the swing parameter X = X/Xcrit (with Xcrit = quite 

optimum (Bureau et al. 1999). But the main difficulty for this solution is to 
explain the non-star formation. 

Another possibility to trigger bar formation is to assume that the non- 
baryonic dark matter halo is triaxial, and slowly tumbling. CDM numerical 
simulations have shown that halos are sometimes triaxial, although frequently 
oblate and axisymmetric in the baryonic plane (Katz & Gunn 1991; Dubinski 
1994). In any case, they are rotating very slowly, about an order of magnitude 
less than normal baryonic bars in galaxies, and the HI bar in NGC 2915 should 
also have a very slow pattern speed (Bureau et al. 1999). Such a model has 
been simulated recently by Bekki & Freeman (2002) who also require the 
inclination of the disk with respect to the plane including the triaxial halo’s 
long and middle axes. Simulations have also been done of a baryonic disk dark 
matter, a model than can reproduce the observed pattern quite satisfactorily 
(Masset & Bureau, 2002). 

4.5 Formation of the Structures 

How to form and stabilize the hierarchical structure of the H 2 gas? Self- 
gravitating simulations have already shown that recursive fragmentation oc- 
curs, and forms transiently a self-similar structure, however all the mass is 
then collapsing into dense cores (Klessen et al. 1998). 
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Fig. 25. (Left) Snapshot of a 2-D simulation of formation of structures in the 
galactic shear. Self-gravity and dissipation of the gas are taken into acconnt. The 
galactic center is toward the bottom of the box. {Right) Fractal dimension computed 
at the same epoch, as a function of scale (from Semelin & Combes 2000). 



N-body simulations were carried out (Semelin & Combes 2000), with simi- 
lar initial conditions, periodic boundary conditions, and a perturbation spec- 
trum P{k) oc but unlike previous simulations, there is a schematical 

process to shift to adiabatic regime at low scale. The code used was the tree- 
code, with sticky particles included. The scheme to stop the dissipation and 
fragmentation at the smallest scale was varied, either by elastic collisions, 
or even super-elastic collisions, to re-inject energy at small-scales. A frac- 
tal structure is indeed observed, but is not stationary. Energy re-injection is 
required also at large scale, to renew the structure. 

One way to re-inject energy at large scale is the differential rotation of the 
Galaxy. Computations of self-gravitating shearing sheets have already given 
interesting large-scale structures, that might be similar to fractals (Toomre 
1990). 

When galactic shear (and Coriolis forces) are taken into account in molec- 
ular cloud formation simulations, the structures are formed over several orders 
of magnitude. The characteristic spiral filaments are observed, and embed- 
ded clumps form, and can disappear and reform stochastically. Constantly 
the shear destroys the small clumps, that reformed again. Simulations of a 
shearing sheet are shown in Fig 25. The fractal dimension varies with scale 
and with the epoch in the simulation. It is lower at small scale (the clumps) 
and larger (around 1.8) just below the sheared filaments scale. 

The work was generalised by Huber & Pfenniger (2001), who made the 
computations in 3D, taking into account the galaxy plane thickness of the 
gaseous component. Their cooling is simulated by a viscous force, propor- 
tional to the particle velocity. They also find the characteristic filaments due 
to the shear combined with self-gravity, and measure the corresponding frac- 
tal dimension of the structures. Since the filaments are much longer in the 





148 



F. Combes 



4.3 




Fig. 26. (Left) Evolution of a shearing box, at 4.3 and 10 rotations around the 
galactic center, with a “medium” dissipation for the gas; {Right) Fractal dimension 
as a function of scale, for different values of the dissipation; the fractal dimension 
is smaller for strong dissipation (from Huber & Pfenniger 2001). 





plane than thick (perpendicular to it), the fractal dimension is in fact dom- 
inated by the z-dimension plane morphology, at least at large scales. The 
measured fractal dimension is highly dependent on the amount of dissipation 
(see Fig. 26), since stronger dissipation enhances the formation of clumps in 
the sheared filaments. Although the dynamical range of simulations until now 
is too small to probe entirely a true fractal distribution, these calculations are 
encouraging that a fractal structure could develop at sub-kpc scales, when 
shearing is dominant, near the high cut-off of the ISM fractal. 

Wada & Norman (1999) have also simulated a shearing galaxy disk, but 
with a multi-phase medium: they obtain the formation of dense clumps and 
filaments, surrounded by a hot diffuse medium. The quasi-stationary filamen- 
tary structure of the cold gas is another manifestation of the combination of 
shear and self-gravity in a cooled gas. 

4.6 Conclusions 

Gaseous disks, and in particular the H 2 component, are not in dynamical 
equilibrium or marginally, and are self-regulated by gravitational instabilities. 
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They are unstable at all scales and subject to spiral structure, filaments, 
clumpy hierarchical structure. 

To explain the fractal structure of the cloud ensemble, self-gravity is re- 
quired, together with injection of energy at large-scale, and may be at small 
scale. The galactic rotation is the main source of energy, and it takes Gyr for 
the gas in a galactic disk to flow slowly to the center. These radial flows can 
be faster in the case of external perturbations, such as galaxy interactions. 

The presence of gravitational instabilities, for cloud and structure for- 
mation, is necessary but not sufficient for star formation. Other processes 
must be considered to trigger star formation, which would explain the large 
extensions of unstable gas without stars in the outer parts of galaxies. 

5 H 2 in External Galaxies 

5.1 H 2 Content and Morphological Types 

There are several published surveys of CO emission in galaxies, in particular 
more than 300 galaxies have been observed in the FCRAO survey (Young 

6 Knezek 1989), where the average molecular content is found comparable 
to the atomic one M(H 2 /M(HI~ 1 (Young & Scoville 1991). Part of the 
galaxies may have a bias, in particular some are selected through the 100/im 
IRAS flux, which is in general well correlated with CO emission. Another 
survey by Casoli et al. (1998) near the Coma cluster has shown on average 
M(H2)/M(HI) ~ 0.2. 

The relative molecular content varies strongly with morphological type: 
while the HI content is more abundant in late-type galaxies, the H 2 content 
has the opposite trend. The apparent H 2 /HI mass ratio decreases monotonously 
from SO to Sm galaxies by a factor near 10 (Young & Knezek 1989; Casoli et 
al. 1998), as shown in Fig. 27. This gradient towards the late types might be 
only due to a reduced CO emission, because of metallicity effects, and not to 
an intrinsic reduction of the H 2 content. When only the more massive objects 
are taken into account, this tendency with type disappear: there is no gradi- 
ent of molecular fraction. This supports the hypothesis that the gradient is 
due to metallicity, which is correlated with total mass. 

The relation linking mass M and metallicity Z is observed of the form 
Z oc (e.g. Zaritsky 1993; Tamuraet al. 2001), in agreement with models 
(e.g. Yoshii & Arimoto 1987). Such a relation is due to the fact that the star 
formation rate is non-linear with gas density, and deeper potential wells retain 
more efficiently their stellar ejecta. 

The strong dependency of the CO-to-H 2 conversion factor X on metal- 
licity has been confirmed now in many galaxies. In the Small Magellanic 
Cloud, X could be 10 times higher than the standard value of 2.3 • 10^° 
molcm“^ (Kkm/s)“^ (Rubio et al. 1993). The effect has been seen also in 
Local Group galaxies, such as M31, M33, NGC 6822 and IC 10 (Wilson 1995). 
The physical explanation is complex, since the CO lines are optically thick. 
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□ <log H2/HI> (SA) 




Morphological type 

Fig. 27. H 2 /HI mass ratio in galaxies as a function of morphological type; full 
squares: mean values with upper limits treated as detections; empty squares: mean 
values with upper limits taken into account using the survival analysis (SA), from 
Casoli et al. (1998). 



but the size of the clouds (and therefore the filling factor) decreases with 
metallicity, both due to direct CO abundance, and UV-phtotodissociation 
increased by the depletion of dust. When the dust is depleted by a factor 20, 
there is only 10% less H 2 but 95% less CO (Maloney & Black 1988). 

CO in Dwarf Galaxies: In dwarf irregular galaxies, rich in HI gas, CO 
emission is very low, and it is difficult to know the H 2 content. If the HI/H 2 
ratio is assumed constant from galaxy to galaxy, then X varies with 
(Arnault et al. 1988). Dwarf Ellipticals are easier to detect than Dwarf Irreg- 
ulars (Sage et al. 1992). Observations by Gondhalekar et al. (1998), Taylor et 
al. (1998), and Barone et al. (2000) confirm this strong dependency, and con- 
clude that the conversion factor must be varying nonlinearly with metallicity, 
increasing sharply below ~ 1/10 of the solar metallicity [12-|-log(0/H) < 7.9] 
(cf. Fig. 28). 



Low Surface Brightness Galaxies: Low Surface Brightness (LSB) galax- 
ies have in general large characteristic radii, large gas fraction (up to /g = 
95% in LSB dwarfs, Shombert et al. 2001), and are dark matter dominated. 
They are quite un-evolved objects. Their total gas content is similar to that of 
normal (HSB) galaxies (McCaugh & de Blok 1997). They have also low sur- 
face densities of HI, although with large sizes, and in their physical properties 
they resemble the outer parts of normal HSB galaxies. 
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Fig. 28 . Dependence of the CO integrated intensity Ico (normally proportional 
to the average H 2 surface density N{¥L 2 ) on the oxygen abundance, or metallicity 
(from Taylor et al. 1998). 



Their metallicity is low, according to the correlation of metallicity with 
surface brightness (Vila-Costas & Edmunds 1992). It is therefore no surprise 
that CO emission is difficult to detect in LSB (de Blok & van der Hulst 
1998). Only in edge-on LSB galaxies CO has been detected (Matthews & 
Gao 2001), and the derived M(H 2 )/M(HI) is between 1 — 5 • 10“^ (with 
standard conversion ratio). Due to their low surface density, these galaxies 
have a very low efficiency of star formation (van Zee et al. 1997). A gas rich 
galaxy is stable only at very low Sg (cf. Malin 1, Impey & Bothun 1989). 
The cause of their low evolution rate could be the absence of companions, 
since LSB live in poor environments (Zaritsky & Lorrimer 1993). It is well 
known that galaxy interactions, by driving in a high amount of gas, trigger 
star formation. 

Since their discovery, the recognition that a large number of LSB exist 
change the luminosity function of galaxies; LSB could be a significant reser- 
voir of baryons in the local universe (Bothun et al. 1997). Their low evolution 
time-scale is compatible with models (Boissier & Prantzos 2000). 

It is important to note that LSB obey the same Tully-Fisher relation 
as normal galaxies; for the same rotational velocity V, these galaxies are 
twice as large in R, and according to the Virial, M (x V^R and the mass- 
to- light ratio M/L increases as surface density decreases. In general, the 
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Tully-Fisher relation for gas-rich galaxies works much better in adding the 
gas mass to the total mass of the galaxies, in some sense including the gas 
mass in the “luminosity” . It appears that the TF-relation is a link between the 
baryonic mass and the rotational velocity (it is the “baryonic TF” , McGaugh 
et al. 2000). 

5.2 Molecular Disks Structure 

Radial Distribution in Spirals: The atomic and molecular gas (traced by 
CO) have very different radial distributions (cf. Fig. 29). The CO emission, 
like all components related to star formation, follows an exponential distri- 
bution, while the HI gas alone is extending much beyond the “optical” disk, 
in average i?Hi = 2i?opt- The HI is often depleted in the center of galaxies, 
and sometimes the depletion is compensated by H 2 , but it is not always the 
case (e.g. M31). The larger sample of radial distributions reported in the 
BIMA-SONG survey (Regan et al. 2001) confirm these conclusions. 




Fig. 29. Radial distributions of various surface densities in a typical spiral galaxy 
NGC6946: H 2 (CO) and HI column densities. Blue, Radio-continuum and Ha sur- 
face densities (adapted from Tacconi & Young, 1986). 
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Fig. 30. Integrated CO intensity /(CO) contonrs, superposed on the Digital Sky 
Survey (DSS) red image of M31. The black dot in the bottom left corner represents 
the telescope 23” beam (from Neininger et al. 1998). 



Spiral Structure: The H 2 component participates even better than the HI 
and stellar component to the density waves. Due to its low velocity dispersion, 
it is more easily trapped in the spiral potential wells, and the structure is more 
contrasted. Large streaming motions are detected at the crossing of the spiral 
density wave. A good illustration is plotted in Fig. 30, with the CO(l — 0) 
map of M31, with 23” (90 pc) resolution, by Neininger et al. (1998). The on- 
the-fly mapping procedure allows to reach a large dynamical range in spatial 
scales. The CO emission is particularly well correlated to the dust lanes and 
more generally to HI in the arms. 

The particular large CO emission contrast may be explained also by other 
factors, such as: 
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• different CO excitation in arms (larger density), 

• heating due to the enhanced star formation, 

• enhanced CMC formation in arms, 

• enhanced H 2 or CO formation (chemical time-scale of 10® yr), 

• HI is formed out of photo-dissociation of H 2 , just at the edge of the arms, 

• more self-shielding of CO in the arms. 

The CO emission appears to follow the dust lanes, in the concave side of 
spiral arms (cf. M51, Tilanus & Allen 1991), and is often gathered in clouds 
aligned as pearls on the spiral string. 

5.3 H 2 in Barred Galaxies 

CO emission is particularly useful to map the bar and rings in spiral galaxies, 
since HI is in general depleted in central regions, and Ha is often obscured. 

Barred galaxies have more CO emission, and the derived H 2 gas is more 
concentrated to the circum-nuclear regions than in unbarred galaxies (Saka- 
moto et al. 1999). This confirms dynamical theories of transport of the gas by 
bars (e.g. Combes 2001). More than half of the gas in the central kpc comes 
from the large-scale galactic disk and is too high in surface density to have 
been consumed through star formation, the present rate being 0.5 — lMQyr“^. 
There is no correlation, however, with any AGN activity (Seyfert, Liner, 
etc. . . ) , although bars have been invoked as a mechanism to fuel the central 
black holes (e.g. Mulchaey & Regan 1997). 

CO maps of barred galaxies reveal all kinds of morphologies, rings, gas 
bars, spiral structure, characteristic “twin peaks” , where the gas accumulates 
at the crossing of parallel and perpendicular orbits, at the start of the leading 
offset dust-lanes. The CO gas traces particularly well the resonant rings (e.g. 
Buta & Combes 1996). That the CO traces the offset dust-lanes is conspicuous 
in the well studied barred galaxy NGC 1530 (Reynaud & Downes 1997). 

In a small sample of barred galaxies, Sheth et al. (2002) have observed 
a spatial offset of 300 pc in average between Ha arms and CO arms; this is 
interpreted as a time-delay between gas accumulation and star formation, 
and depends on the gas flow. The gas flows can inhibit star formation, when 
they are too fast (Reynaud & Downes, 1999). The star formation is favored 
in ring accumulations, or nuclear bars. 

Two embedded bars, with different pattern speeds are sometimes required 
by observations of morphology and dynamics, as in the example of M 100 
(Garcia-Burillo et al. 1998a, Fig. 31). Sometimes, the gas in the center is 
also observed at high altitude above the plane (cf. NGC 891, 5907, or the 
peanut-shaped galaxy NGC 4013, Garcia-Burillo et al. 1999). These might 
be accounted for by processes associated to star-formation, more than purely 
dynamical mechanisms. The only exception could be gas in retrograde orbits. 

The CO gas has been detected also in counter-rotating galaxies, as in the 
early-type spiral NGC 3626 (Garcia-Burillo et al. 1998b), where the H 2 is 
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Fig. 31. {Left) Integrated CO emission contours obtained towards MlOO 
(NGC4321) with the IRAM interferometer, overlaid on an Ha grey scale image. 
{Right) Particle orbits of molecular clouds in a simulation with only one pattern 
speed {top, 17 = 23 km/s/kpc), and with two different pattern speeds for the two em- 
bedded bars {bottom, 17 = 23 and 160 km/s/kpc) from Garcia-Burillo et al. (1998a). 
Only in the second case, a nuclear spiral structure is obtained, similar to the one 
observed in CO {left). 



concentrated in a compact nuclear disk of ^ 1 kpc radius, rotating in a sense 
opposite to that of the stars, and in the same sense as the HII and HI gas. The 
counter-rotation gives rise to 2-streams instability, and the formation of m = 
1 leading spiral structure (example NGC3593, Garcia-Burillo et al. 2000). 

Molecular gas can be expelled out of the galactic plane in rare circum- 
stances, in violently interacting systems: e.g. NGG 4438 in the Virgo cluster 
(Gombes et al. 1988), NGG4631/56 (Neininger 2000). 

5.4 Molecular Gas in Polar Ring Galaxies 

Polar Ring Galaxies (PRG) are due either to accreted gas from a companion 
or are formed in a merger of two galaxies with orthogonal disk orientations 
(e.g. Bekki 1998). PRG are used to determine the flattening of the dark 
matter, since the kinematics of the polar material probe the 3D potential. 

The polar ring is due to gas resettling in the polar plane, but stars are 
dispersed in a wide-scale halo. The ages of the stars in the polar ring thus date 
the accretion/merger event. Frequently large amount of HI gas are detected 
in the polar planes, rotating around the central early-type galaxies. This gas 
is likely to come from outer parts of the interacting galaxies, and therefore 
its metallicity should be low, unless it has been enriched by the subsequent 
star formation. 

GO emission has been detected in polar rings (Taniguchi et al. 1990, 
Gombes et al. 1992, Watson et al. 1994). This could bring some insight in the 
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formation event: metallicity coming from the recent star formation, or the 
merged perpendicular disk forming the polar ring has not been destroyed. 

5.5 Molecular Gas in Elliptical Galaxies 

Most elliptical galaxies possess accreted gas, already detected in HI (Knapp 
et al. 1979, van Gorkom et al. 1997). This gas is either the remnant of the 
merger event between two spirals that have given birth to the elliptical, or 
comes from subsequent accretion of small gas-rich companions. The emission 
of the cold dust has been detected at 100 ^m by IRAS (Knapp et al. 1989), 
and CO molecules are also frequently detected (Lees et al. 1991, Wiklind et 
al. 1995). 

The relative H 2 content is low (see Fig. 27), the M(H 2 )/M(HI) ratio is 
2 — 5 times lower than in spiral galaxies. The CO gas is more present in field 
ellipticals than in clusters. The excitation temperature, as measured by the 
ratio between CO(2 — 1) and CO(l — 0) emissions, is low, suggesting low 
molecular densities. There is also a small gas-to-dust ratio, but correlated to 
low dust temperature, which might reveal only an excitation effect. Indeed, 
the comparison between H 2 mass obtained from CO emission and IRAS dust 
emission reveals that, at low dust temperature, the CO emission is not tracing 
properly the molecular content (Wiklind et al. 1995). 

Maps of the CO emission reveals that the gas is rotating in thin disks, that 
will form the typical stellar disks seen in many ellipticals. This gas does not 
come from mass loss of the parent stars, since it has a much higher specific 
angular momentum, but must have been accreted (Young 2002). 



Shells Around Ellipticals: The merging events giving birth to ellipticals 
are also forming shells. Shells are very thin stellar features, discovered by Ma- 
lin & Carter (1983) with the unsharp masking technique. Since shells always 
accompany mergers and interactions, they are likely to be the vestiges of the 
stars dispersed in the galaxy-collision event. Simulations confirm the scenario 
(Quinn 1984; Dupraz & Combes 1987). The stars of the small companion, 
disrupted in the interaction, oscillate back and forth in the potential well of 
the elliptical galaxy, and wrap in phase space such as to form several density 
waves at their apocenter, the “shells”. 

Recently, HI gas has been detected in shells (e.g. Schiminovich et al. 1994), 
while only stars where expected to follow the shells. Normally, the diffuse gas 
condenses to the center in the phase- wrap process (Weil & Hernquist 1993). 

But CO is now also detected in shells (Charmandaris et al. 2000, see 
Fig. 32). It is therefore possible that the HI gas is just coming from photo- 
dissociated molecular clumps, which themselves are so fragmented and com- 
pact, that they can follow nearly radial orbits throughout the center, without 
experiencing much dissipation. The phase wrapping process is thus still the 
preferred formation mechanism for these gaseous shells. 
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Fig. 32. {Left) Digitized Sky Survey optical image of Cen A with the contours of 
HI gas (from Schiminovich et al. 1994) superimposed in white. North is up and east 
is to the left, while the image scale is shown by the horizontal bar. The positions 
observed in CO are marked with the red circles whose size corresponds to the SEST 
44" beam of CO (1 — 0). The locations of the outer stellar shells are underlined by the 
yellow solid lines. The inner 6cm radio continuum lobes (from Clarke et al. 1992) 
are depicted by the blue contours. Note the jet alignment with the location of the 
CO detections. The outer radio lobes are far more extended. {Right top) CO(l — 0) 
and CO(2 — 1) spectra towards the northern shell SI with the temperature scale in 
main beam Tmb, {Right bottom) Same for the southern shell S2 (from Charmandaris 
et al. 2000). 



5.6 CO Emission in Tidal Dwarf Galaxies 

During major mergers between two spiral galaxies, long tidal tails develop, 
and it is frequent that the gas of the tails experience gravitational instabilities, 
and condenses in large complexes, giving rise to new stars: the tidal dwarf 
galaxies (e.g. Due & Mirabel 1994). CO emission has been detected in these 
small dwarfs (see Fig. 33), rather easily with respect to their small mass and 
therefore expected small metallicity (Braine et al. 2000, 2001). This supports 
the idea that these dwarfs have been formed out of recycled gas, coming from 
the giant galaxies, and therefore have a high metallicity for their mass. 

Is the molecular gas dragged with the tidal tail gas and recondenses in 
giant complexes in the tidal dwarf, or does the molecular gas itself re-form 
locally in the collapse? This might not be an important question, since in 
normal interstellar gas, the chemical time-scale to form molecules is much 
less than the dynamical time, and it is likely that the gas remains in part 




158 



F. Combes 




Fig. 33. (Left) The tidal dwarf galaxy in the interacting system Arp 105 (NGC 3561 
or “the Guitar”). The zoom on the dwarf galaxy in the South shows the HI con- 
tours superposed to the V-band image. The dark circle (at the position of the HI 
maximum) is the central position where the CO emission has been detected, and 
its size is 22” , the beam size of the IRAM-30 m telescope (from Braine et al. 2001). 
{Right) The tidal dwarf galaxy in the interacting system Arp 245 (NGC 2992/3). 
The HI contours are shown in the magnified image of the Northen dwarf. CO(l — 0) 
emission has been detected at and around the dark circle position, around the HI 
maximum (from Braine et al. 2001). 



molecular all along the tidal tails, even if it is more difficult to detect it, 
because of low density and low excitation. One puzzle is that the tidal dwarfs 
are most often located at the tips of the tidal tails detected in HI, while the 
simulations predict them all along the tails (Barnes & Hernquist 1992). 

What is the fate of these tidal dwarfs, and could they enrich the in- 
terstellar medium of peculiar metallic dwarf galaxies? In general, they are 
re-accreted and merge with their parent galaxies; the tidal tails themselves 
have been observed to infall back to the merger remnant (Hibbard & Mihos 
1995). 

5.7 Conclusions 

The CO emission in galaxies depends on morphological type, and has a be- 
haviour opposite to that of HI, but this could be only a metallicity effect. 

Galaxies can have large gas mass fractions, when they have low surface 
brightness, and are therefore stable. LSB galaxies are un-evolved, extended, 
un-concentrated systems, they may contain large quantities of H 2 but have 
low CO emission, because of their low metallicity. Their un-evolved state 
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might be due to their poor environment, or their large intrinsic spin param- 
eter. 

CO emission is a good tracer of density waves, spirals, bars, rings. It is 
often associated to the thin dust lanes, tracing large-scale galactic shocks. 

Its radial distribution is overall exponential, following the optical lumi- 
nosity. But there are large departures from the ideal exponential, contrary to 
stars, due to inhomogeneities and contrasted features. 

Elliptical galaxies contain cold H 2 gas, with a lower M(H 2 )/M(HI) ratio 
than in spirals. This is due either to different excitation, or a different CO/H 2 
conversion ratio. This gas is coming from accretion, there is no correlation 
with the stellar component of the parent galaxy. 

CO emission is a very useful tool to trace density, star formation, pertur- 
bations like warps, polar rings, gas dragged out of the spiral planes. 



6 Molecules in ULIRGs, and High Density Tracers 

6.1 Ultra-luminous Galaxies 

The role of large-scale dynamics and interactions of galaxies is now well rec- 
ognized for the starburst activity in galaxies (Kennicutt et al. 1987; Sanders 
et al. 1988). IRAS ultra-luminous galaxies are all mergers (Sanders & Mirabel 
1996), and the fraction of interacting galaxies is increasing with Lir/Lb- 
In addition, interacting galaxies appear to have more H 2 content or at 
least much more CO emission, than unperturbed and isolated galaxies. The 
H 2 gas is also more concentrated towards the galaxy nucleus. In average, 
the H 2 content is multiplied by 4 — 5 (Braine & Combes 1993) with respect 
to unperturbed systems, as shown in Fig. 34. This can be explained by the 
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Fig. 34. (Left) Central H 2 mass versus FIR luminosity, both normalised to Blue 
luminosity, to avoid size and distance biases. Averages for the isolated, close and 
disturbed galaxies (number of galaxies indicated) are plotted with error bars be- 
ing the uncertainty in the mean for each group. (Middle) Comparison of the two 
normalisations, by optical surface of the galaxy or Blue luminosity. (Right) The 
size-independent M(H 2 )— FIR relation, plotted with indication of the interaction 
state of the galaxies. The slope is nearly unity (from Braine & Combes 1993). 
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gravitational torques of the interactions driving gas very quickly to the cen- 
ters. This dense gas accumulation towards the circumnuclear regions can then 
trigger a starburst. The condition for a starburst is that the gas accumula- 
tion occurs in a time short enough that feedback mechanisms have no time 
to regulate the star formation. This condition is better fulfilled towards the 
nuclei where the dynamical time-scale is short (~ 10^ yr), and where the gas 
density can reach a maximum. 

There is more star formation, and more H 2 content in interacting galax- 
ies, but the star formation efficiency, which is defined by the ratio between 
star-formation indicators like the FIR or the Ha luminosity and the amount 
of gaseous fuel, is also larger (e.g. Rownd & Young 1999). The star forma- 
tion efficiency does not depend on Hubble type, but is sensitive to extreme 
variations in the galaxy environment. The star formation efficiency is lower 
for HI deficient galaxies in clusters, and it is enhanced in strong interactions 
and mergers. Could it be that the CO/H 2 conversion ratio is different in per- 
turbed galaxies? the gas is warmer and more excited, but it is denser too, 
and the conversion ratio roughly varies as Y oc and it is not obvious 

in which sense it is perturbed. If the derived molecular gas is lower, than the 
star formation efficiency SFE = Lfir/M(H 2 ) would even be larger. 



High End of the Luminosity Function: At luminosities above 10^^ Lq, 
infrared galaxies become the dominant population of extragalactic objects in 
the local Universe (z < 0.3). As shown in Fig. 35, they are more numerous 
than optically selected starburst, Seyfert galaxies and QSO at comparable 




Fig. 35. (Left) Variation of the mean SED (Spectral Energy Distribution from 
submillimeter to UV wavelengths) with increasing L/_r for a 60 /rm sample of in- 
frared galaxies. (Right) The luminosity function for infrared galaxies compared with 
other extragalactic objects: IRAS Bright Galaxy Survey (BGS), IRAS 1-Jy Survey 
of ULIRGs, Markarian Seyfert, Markarian starbursts, cDs, Palomar-Green QSOs 
and normal galaxies, from Sanders & Mirabel (1996). 
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bolometric luminosity (Sanders & Mirabel 1996). Their energy is essentially 
coming from a starburst, although a fraction comes also from an active nu- 
cleus, whose importance increases with total luminosity (Kim et al. 1998). In 
the majority of objects, the infrared luminosity is due to dust heating from an 
intense starburst within giant molecular clouds. At the highest luminosities 
Lir > IO^^Lq, nearly all objects appear to be advanced mergers, fueled by 
enormous concentration of molecular gas that has been dynamically driven 
into the circum-nuclear regions. 

Their SED (Spectral Energy Distribution) has the striking particularity 
that most of the energy is getting out in the infra-red. Since a lot of gas 
and dust are present, obscuration is high, and the optical light coming from 
star formation is absorbed, and is used to heat the dust. The energy of the 
starburst is then recycled, and re-radiated by the dust at lower frequency. 
Figure 35 shows how the infrared luminosity can vary by more than three 
orders of magnitude, when the optical luminosity only varies by a factor 2. 
The ratio Lir/Lb varies considerably, and is therefore an excellent indicator 
of starbursts. The brightest objects are the more obscured ones. The ratio 
between the IRAS fluxes at 60 and 100/rm, F60/E100 also increases with 
infrared luminosity: the brightest objects have a hotter dust, due to more 
star formation in a given volume. Recent HST observations of samples of 
ULIRGs have confirmed how perturbed and interacting they are (Borne et 
al. 2000). 

A remarkable feature is also the excellent correlation between the radio 
continuum emission at centimeter wavelength, and the far-infrared emission 
of galaxies. The only exceptions are the radio-loud AGN, where most of the 
radio emission comes from the active nuclei. The origin of the tight correlation 
must be that the star formation explains both emissions, the FIR coming 
from the dust heated by young stars, and the radio-continuum (RG) being 
synchrotron emission from supernovae. 

When the radial distribution of both emission are considered, than the ob- 
served tight correlation is revealed in the inner regions only where OB stars 
and the associated Type II supernovae control the FIR and RG emission 
(Mayya & Regaradjan, 1997). In the outer parts heating of very small dust 
grains by the old disk stars provides a secondary component in the FIR emis- 
sion, without associated RG emission. This explains also why galaxies with 
very weak star formation rate do not follow the relation so tightly. Another 
factor is the radial smoothing of the radio emission as a result of the propa- 
gation of energetic electrons accelerated during the supernova phase (Marsh 
& Helou 1995). The tight RG— FIR relation could also be used to deduce the 
mean magnetic held strength in galaxies, since the underlying mechanism for 
the correlation implies the linear correlation between the energy densities of 
the radiation held and the magnetic held (Lisenfeld et al. 1997). 
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6.2 Molecular Gas in ULIRGs 

These ultra-luminous galaxies have huge quantities of H 2 gas. The gas is dense 
and hot, with densities of the order of 10^ to 10^ cm“^, and dust temperatures 
of 60 — 80 K. These physical conditions are similar to what exists in star 
forming regions in Giant Molecular Clouds, like in the Orion complex for 
instance. But these regions in our Galaxy are very diluted spatially, while 
they are highly packed in starburst galaxies. 

The large sample of ULIRGs observed in Solomon et al. (1997), reveals 
a tight correlation between the CO and 100 /xm luminosities, suggesting that 
the dusty gas is close to the black body emission regime (see Fig. 37). The 
CO and dust emissions comes from very small sizes, since the gas giving rise 
to the starburst is highly confined in the circum-nuclear region, inside the 
central kpc. In the prototype Arp 200 for example, the main starburst is inside 
300 pc. Two disks are seen in the late stage of merging, as shown by the two 
components in the velocity dispersion, and in the 1.3 mm continuum coming 
from dust emission (Fig. 36). This explains the high optical thickness, and 
the near black-body regime. Usually in galaxies, the dust emission at 100 /xm 
is optically thin. However, the optical depth is varying with frequency as 
r oc , with ~ 2, and significant optical depth may begin at 60 /xm. 

Gas is concentrated in central nuclear disks or rings (Downes & Solomon 
1998). What is the stability of these central disks? If the Toomre criterion is 
computed with gas only, it is Qg slightly larger than 2, but taking into account 
stars also, the disk might be at the limit of stability, with a combined gas and 
stars criterion Q ~ 1. Due to the peculiarities of its physical condition, the 
ISM of interacting galaxies is likely to condense in large gaseous complexes, 
of the order of 10® Mq (Elmegreen et al. 1993). 

Interacting galaxies have gas with enhanced velocity dispersion, due to 
the intense dynamical perturbations: streaming motions, asymmetries, spiral 
arms, strong bars, etc. . . This has the consequence to increase the critical 
Jeans scale for gravitational instabilities, and to create giant complexes. The 
global Jeans length is A oc c^/Ug, where c is the velocity dispersion and A7g is 
the gas surface density of the galactic disk. The corresponding growth time 
is Tff oc c/A7g, and the instabilities will occur as soon as Q oc c/t/A7g becomes 
lower than 1. For the same ratio c/Ug, a perturbed system with elevated c 
and A7g will see the condensations of larger complexes of mass M oc / Sg, 
in the same time-scale r. These complexes with larger internal dispersions, 
and larger gravitational support will be less easy to disrupt through star- 
formation, which enhances the star-formation efficiency. The thermal Jeans 
length is also larger, due to the hotter gas temperature induced by the larger 
number of stars in the clouds, and high mass stars are favored. 

This may explain the formation of giant star clusters, as the Super Star 
Glusters (SSG) observed in starbursting galaxies (e.g. de Grijs et al. 2001). 
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Fig. 36. CO(2 — 1) emission map towards Arp 220 obtained with the IRAM in- 
terferometer with a 0.7 x 0,5 ’’beam. {Top left) Integrated emission, {Top right) 
Velocity field, {Bottom left) Velocity dispersion, {Bottom right) 1.3mm continuum, 
from Downes & Solomon (1998). 



Black Body Model: Due to the small sizes of nuclear gaseous disks in 
starbursts, the average column density reach easily A^(H 2 ) > 10^^cm“^, and 
therefore the optical depth of the continuum dust emission reaches t ^ 1 at 
100 ^m. 

In the extreme case of a black-body model, the far-infrared luminosity, 
for a dust temperature Td, of a nuclear region of size R, is: 

Lfir = 4^i?Vr4 , (44) 

where a is the Stefan-Boltzmann constant; there is no optical depth term, 
since it is assumed t ^ 1. In parallel, the CO luminosity can be expressed, 
in the Rayleigh- Jeans domain, at frequency v <C kT/h: 

Leo = 47r^R2(2fc/A2) J Ttdu , (45) 

the ratio of the two luminosities is then varying as 

Leo fv^V 



(46) 
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Fig. 37. {Left) Integrated CO(l-O) line intensity, I(CO), versus IRAS 100 /rm flux 
density, for the ultraluminous galaxies of Solomon et al. (1997). The solid line has 
a slope 1. {Right) Far infrared to CO luminosity ratio versus the blackbody dust 
temperature derived from the IRAS flux ratio S60/S100. The line indicates the 
upper limit for the luminosity ratio for the blackbody model, from Solomon et 
al. (1997). 



where fv is the filling factor in velocity, and AV the total velocity width. 

This relation corresponds to the predicted curve plotted in Fig. 37. The 
data points are slightly below this maximum curve, may be because the dust 
temperature Td is different than the brightness temperature of the gas T^, or 
because the CO and FIR are not emitted by exactly the same regions (the 
CO region could be larger) , or the velocity filling factor is small, reducing the 
optical thickness of the line. 



AGN or Starburst: According to mid-infrared line diagnostics or PAH 
abundances, about 80% of all ULIRGs are found to be predominantly powered 
by star formation (Lutz et al. 1998), but the fraction of AGN-powered objects 
increases with luminosity. Whereas only about 15% of ULIRGs at luminosities 
below 2 • 10^^ Lq are AGN powered, this fraction increases to about half at 
higher luminosity. 

In these ULIRGs containing such large molecular masses, it is inevitable 
that most of the luminosity comes from the starburst, since the black hole 
cannot swallow all the available gas, and nuclear disks are unstable to star 
formation. To give orders of magnitude, if the molecular disk has a radius 
of 500 pc, a rotational velocity of Uot = 300km/s, its period is lOMyr. As- 
suming a FIR luminosity of = 10^^ Lq, and a star formation rate of 

5OM0/yr, it needs 10 rotations (or 100 Myr) for half of the gas to be turned 
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into stars, the total H2 mass being 5-10® Mq. For this a radiation efficiency of 
M*/Lfir = S-IO^^Mq/Lq, or L/M ~ 200, has been adopted, corresponding 
to observations. 

On the other hand, if 10^^ Lq comes from accretion onto a black hole, at 
the efficiency of L = 0.1 {dm/dt)c^, the accretion must be only 1 MQ/yr, and 
therefore only 1% of the gas would be accreted on the same time-scale. The 
gas would remain available at 99% to form stars (Downes & Solomon 1998). 



Dynamical Triggering: Numerical simulations (e.g. Barnes & Hernquist 
1992) demonstrate the efficiency for gravity torques to drive the gas towards 
the center in a short time-scale. Mihos & Hernquist (1994) including star 
formation recipes like the Schmidt law in their simulations, show how eas- 
ily starbursts can be obtained in galaxy interactions, and sometimes several 
bursts occur, according to the orbit and pericentric distances. 

The self-gravity of the interacting disks play the fundamental role in 
galaxy interactions. Gas is driven to the nuclei, from bars and spirals trig- 
gered in its own disk by the perturbation. This explains why the mass ratio 
between the bulge and the disk is a more fundamental parameter than the 
geometry of the encounter (Mihos & Hernquist 1996). 

The effects of the geometry are more visible on the direct manifestations of 
the tidal interaction, i.e. on the tails and debris. For the galaxy experiencing 
a retrograde collision, no extended tidal tail is formed. There is no resonance 
effects in the target galaxy disk, and less violent material perturbations; a 
transient leading tidal arm is instead developed. There is however a tidally- 
induced two-arm density wave, and the subsequent torques produce gas inflow 
towards the center, as for the prograde encounter. 

6.3 High Density Tracers 

Observing the wide-spread CO(l — 0) line gives an overall view of the molec- 
ular content of galaxies, but many aspects of the gas would be overlooked if 
other molecules, other isotopes, or other J levels were not explored as well. 

Nuclei of galaxies possess denser gas than the rest of the disk; in general, 
giant molecular clouds must be denser to survive to the larger tidal forces. 
High- J levels of the CO molecule have a higher critical density to be excited, 
10^cm“^ for the CO(3 — 2) transition, 10‘*cm“^ for CO(2 — 1), while only 
10^cm“^ for CO(l — 0). Other high density tracers are the larger dipole 
molecules, such as HCN, HCO+, CS, CH3OH, H2CO, OCS, etc. . . Some 
molecules trace peculiar regions, according to their formation mechanism; 
SiO traces shocks, supershells in starbursts for example. 

The observation of isotopic molecules may highlight optical thickness, 
excitation problems, and also chemical evolution: primary or secondary ele- 
ments can trace the age of the star formation events (as the and for 
instance). 
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A laboratory for such studies is the nearby starburst galaxy M 82: maps of 
all high J-levels of CO have been performed (e.g. Mao et al. 2000). Images at 
high-J are roughly similar in morphology although somewhat less extended 
than CO(l — 0). Two hot spots on either side of the nucleus are always con- 
spicuous in each line, due to the edge-on nuclear ring, in this barred galaxy. 
Inside the ring, the ISM is depleted, this could be due to the bar torques, and 
also to the violence of the starburst, that expelled hot gas in a galactic bipolar 
flow. Radiative transfer has been computed with a LVG (Large Velocity Gra- 
dient) model, and the typical physical conditions are fV(H2) ~ 10^^cm“^ in 
column density, M(H2) in the central region of a few 10 ® Mq, n(H2)10"^ cm“® 
in average volume density, close to the tidal limit. The CO emission in the 
high-J levels comes primarily from PDR (photon-dominated regions), which 
is quite different from the other high density tracers. At least two compo- 
nents in the molecular gas are required to interpret the data: dense cores and 
a more diffuse medium, or intercloud. The diffuse component intervenes in the 
CO(l — 0) emission, also explains the high CI/CO ratio (Stutzki et al. 1997). 
This might not be representative of starburst at high redshift. Kinetic tem- 
peratures are derived between 20 — 60 K, relatively low for a starburst. The 
heating is due to star formation, cosmic rays and turbulence dissipation. 
This result is consistent with the weakness of CH3OH or SiO, which are high 
temperature tracers. 

The SiO molecule has been mapped in M82 by Garcia-Burillo et al. 
(2001). SiO traces the walls of the supershells, related to the bipolar outflow, 
but not the star forming regions. There is in particular a vertical filament, 
part of an SiO chimney coincident with radio cm emission, and ionised gas 
ejected by the starburst. The outflow creates shocks, in which SiO is formed. 

As for the excitation of the GO gas, the observations combined with radia- 
tive transfer analysis (cf. Fig. 38) are consistent with an optically thick gas. 
The intensity ratio between the two isotopic species ^^GO/^®GO is between 
10 and 15 (while the abundance ratio is around 60). 

Mauersberger et al. (1999) have carried out a survey of GO(3 — 2) in 30 
spiral galaxies. The intensity ratio i?(3 — 2/1 — 0) is between 0.2 — 0.7, always 
lower than 1. This is what is predicted if T^in < 50K and n(H2) < 10®cm“®. 
Although dense cores should have higher values, they are relatively diluted 
in the averaging on galactic scales. This is not the case in the nuclei of ultra- 
luminous starburst galaxies, where the dense gas dominates. 

GS and HGN are high density tracers even at low temperatures. The ratios 
GS/GO and HGN/GO are correlated with far-infrared luminosity LpiR. While 
the ratios are around 1/6 in ULIRGs, they are only 1/80 in normal galaxies 
and in the Milky Way. The ratios depend on the spatial resolution, however, 
since they are larger in dense cores. The large-scale morphology is the same as 
in GO (spiral arms, rings, etc. . . ). Strong starbursts have a larger fraction of 
dense gas, as shown on Fig. 39. Not only the HGN luminosity increases with 
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Fig. 38. {Left) CO(l — 0) (dash) CO(2 — 1) (solid) and CO(3 — 2) (histogram) 
profiles toward the center of M 82. All spectra were convolved to the same spatial 
resolntion of 21”; the temperature scale is main beam Tmh, from Mauersberger et 
al. (1999). (Right) Peak line temperatures as a function of upper rotational number 
J for a beam size of 22”, towards the center of M82. The curves are model results 
from radiative transfer calculations, and are compared to the data points, from Mao 
et al. (2000). 




log(LHCN) [K km s ‘ pc^] 




Fig. 39. (Left) Correlation between FIR and HCN luminosities for the Milky Way, 
luminous and ultra- luminous galaxies. The solid line has a slope of 1. (Right) FIR 
versus HCN luminosities, normalised to CO luminosity. The HCN/CO ratio mea- 
sures the fraction of dense gas, and is larger for starburst galaxies, since the FIR/CO 
ratio is a measure of star formation efficiency, from Solomon et al. (1992). 



-^FiR, but the HCN/CO ratio, measuring the fraction of dense gas, increases 
with normalised Afir- 



Isotopic Molecules: The ^^C/^^C abundance ratio in the Milky Way, is 
estimated between 50 — 90 at the Sun radius. There is a measured radial 
gradient in the galaxy, and the ratio tends towards 10 — 20 in the center. 
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This abundance ratio is a tracer of astration, since the is a secondary 
element, formed by low-mass stars (it takes a time-scale of 1 Gyr to enrich it), 
while the is primary, formed at once in massive stars, with a time-scale 
less than 100 Myr. In the Galactic Genter, with high astration level, there 
is also a deficiency of deuterium, and this has to be taken into account in 
observations of DGN, DGO+, etc. . . 

In starbursts and ULIRGS (of Arp 220 type), the ^^GO/^^GO intensity 
ratio is larger. This is not due to a low optical depth, since the G^®0 is normal 
with respect to ^^GO. The right interpretation is that ^^G is overproduced 
in the nucleosynthesis of a recent burst (Gasoli et al. 1992). The ^^G/^^G 
abundance ratio has been mapped in M 82 and IG 342 by Henkel et al. (1998) 
from GN, HGN, HGO-I- observations. The ratio is always ^^G/^^G > 40 (much 
higher than in the Galactic Genter) . In a similar way, the oxygen and nitrogen 
isotopic ratios have been estimated > 100 and > 100. 

HG^^N has been detected in LMG and NGG4945 by Ghin et al. (1999) 
who derived the abundance ratio = 111, lower than in the Milky 

Way. They deduce that is synthesized by massive stars, and is a primary 
element. There is some controversy about formation, at least with respect 
to standard stellar evolution, and nucleosynthesis calculations since it should 
be destroyed in massive stars, instead of being formed in SN-II explosions. 
Molecular results are however consistent: is more secondary, and the ratio 

increases with time and astration. 

Deuterated species have been detected in the LMG: DGN, DGO+ (Ghin 
et al. 1996). The ratio HGN/DGN is of the order of 20, because of strong frac- 
tionation. Although the deuterium abundance is low, D/H = 2- 10“®, deuter- 
ated molecules have lower energies in the ground state then the corresponding 
hydrogenated isotope. At the low temperatures of the ISM, reactions of the 
type: 

HD -b HGN ^ Ha -b DGN , (47) 

are shifted towards the formation of DGN, since the deuterium in molecules 
is a trace with respect to the reservoir of HD. 



The HNC/HCN Ratio: Useful to disentangle abundances, excitation, den- 
sity or temperature, HNG (the hydrogen isocyanide) is a high density tracer 
as well. 

HNG emission is weaker than HGN, except in ULIRGS such as Arp 220, 
where the intensity ratio is > 1. The reason is not very clear however, since in 
NGG 6240 (another prototypical starburst), it is 10 times lower. The compar- 
ison between HNG and HGN, and also HNG and HGO+ is plotted in Fig. 40 
(see Hiittemeister et al. 1995). 



Other Molecules: Other molecules are useful to trace different physical 
conditions. OGS is detected in NGG 253 and M 82 (Mauersberger et al. 1995); 
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Fig. 40. (Left) HNC versus HCN integrated intensities, for starburst galaxies {as- 
terisks), and normal galaxies {filled circles). The dash line indicates equal intensi- 
ties, and the solid line is the better fit. {Right) Same, for HNC versus HCO'*', from 
Hiittemeister et al. (1995). 



NH 3 in Maffei2 and other galaxies (Henkel et al. 2000), with rotational tem- 
peratures of 85 K. H 2 CO (formaldehyde) and CH 3 OH (methanol) in NGC 253 
trace high-density subthermally excited, clumpy structure. All these studies 
point out very different physical conditions and various chemistry, from one 
galaxy to the next (Hiittemeister et al. 1997). 



Atomic Carbon: The Cl fine structure line ^Pi-^Po at 492 GHz is an im- 
portant tracer of non-ionising radiation. In Arp 220, the Cl line is strong, 
as predicted from its FIR flux, while CH (at 158 /im) emission is depleted. 
This could be due to higher density, optical thickness of the C’*' line and dust 
opacity, in the far-infrared. In a sample of about twenty galaxies detected 
in Cl, the intensity ratio has been determined to be CI/CO = 0.2 (Gerin & 
Phillips 2000). The cooling efficiency is comparable for CO and CL Normally 
C’*' is a better coolant than Cl, except in Arp 220 and Mkn231. 

6.4 Conclusions 

The molecular component has a much higher mass fraction in starbursts and 
ULIRGs than in normal galaxies; it is not the case for the HI component. 
It explains the considerable enhancement in star forming efficiency. This H 2 
enhancement is not only due to CO emission enhancement because of gas 
excitation, density or temperature, since all gas density tracers confirm the 
large H 2 abundance and density. 

In interacting galaxies and mergers, gravity torques produce efficient gas 
inffow from the external parts of galaxies towards the center. Large quantities 
of atomic gas can then be transformed in H 2 . Ultra- luminous galaxies have 
dense nuclear disks, concentrating the CO and dust emission; their radiating 
conditions are approaching the black-body regime. 
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Only in strong starbursts is the H 2 gas dense enough to emit sufficiently 
high-J CO lines. This has important consequences for high redshift galaxies, 
since they can be observed only with these lines, redshifted. 

Various molecules help to constrain the physical conditions of the molecu- 
lar gas: density, temperature, excitation, shocks, dumpiness, chemical abun- 
dances. At least two components are required to interpret the observations: 
hot dense cores where stars form, embedded in an intercloud gas, a more 
diffuse medium, likely to be subthermally excited. 



7 Molecules Traced in Absorption 

7.1 Advantages of the Absorption 

Absorption lines are a sensitive probe for studying the interstellar medium. 
Especially in distant objects, where emission lines becomes diluted with dis- 
tance squared, whereas the detectability of absorption lines only depends on 
the observed flux of the background source. This is well known from op- 
tical spectroscopy, where the combination of sensitive detectors and large 
telescopes allow observation of very tenuous gas towards distant background 
QSO. 

In principle, absorption of molecular rotational lines can be used to probe 
the densest and coldest part of the ISM in distant galaxies, much in the same 
way as optical lines probe the warm and diffuse gas. Since new stars are 
formed in molecular clouds, a study of this ISM component trace the star 
formation conditions and its history in galaxies. There are, however, several 
difficulties associated with the detection of such lines and it was not until 
1993 that the first distant molecular absorption line system was detected. 
Since then a total of four such systems at redshifts between z = 0.2 — 0.9 
have been observed. 

Absorption measures are very useful, in particular in the Galaxy, when 
both emission and absorption can be detected along the same line of sight. 
Then it is possible to obtain the physical conditions, temperature T and 
column density N, with a coupled system of two equations linking these 
two quantities, while absorption and emission alone can provide only one 
equation. 

One of the fundamental advantage of absorption studies is to benefit of 
spatial resolution due to the pencil beam towards the background continuum 
source. The QSO size in the millimeter domain is particular small (tens of 
fxas), since only the core of the radio source is emitting, and not the radio 
lobes (as at cm wavelengths). 

However, one must be aware that absorption is biased towards detecting 
cold gas, contrary to emission (e.g. Combes & Wiklind 1996). In the Rayleigh- 
Jeans domain, simple radiative transfer on the line of sight relates the signal 
received, in antenna temperature TJ, to the excitation temperature Tex of 
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the observed medium, the background radiation T\^g, and the optical depth 
r: 

Tl = (Tex-Tbg)(l-e-0 . (48) 

Emission is detected when Tex > Tbg. For the atomic line HI at 21cm for 
instance, there is a large influence of the stimulated emission (“negative ab- 
sorption”) since the AT between the two hyperflne levels F = 1,0 of the 
H-atom ground state is only ~ 0.7 K. The optical depth of the T = 1 level is 
related to the total HI column density iV by a relation: 



r N 
T av (X — , 
T ’ 



(49) 



where the optical depth has been integrated over the whole velocity profile. 
In general, the atomic gas is optical thin, so that r <C 1, and therefore the 
equation, together with the transfer equation, simplifies to: 



N oc 



J TexT dv OC 



jTAdv 



(50) 



for HI emission, since the background temperature Tbg has been neglected. 
The column density is then independent of temperature. On the contrary, 
for absorption, the background temperature is equal to the powerful radio 
source, in front of which Tex is negligible, and therefore the absorbing signal 
has an amplitude equal to Ta = — Tcont^ in the line. The optical depth and 
therefore the intensity of the absorption varies as 1/T for a given column 
density. This is a strong bias to see cold gas in absorption. 

In the millimeter range, the molecules have a whole ladder of rotational 
levels, the emission then depends on temperature since the column density in 
the emitting (upper) level is a function of the total column density and 
the rotational (excitation) temperature: 



Nu 

Ntot 



Z 



^-EJkT ^ 



(51) 



where is the statistical weight of the upper level, and Z the partition 
function. The upper column density is, for emission: 



if r <C 1, and 



For absorption: 



A^u oc J T^y^r dv (X J TAdv ^ 


(52) 


Ntot oc TfVue®“/'=^ . 


(53) 


[Tdv(x ^ , 


(54) 



also strongly weighted by the temperature Tex. 
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Since collisional excitation requires 4- lO"* cm“^ for CO, and 1.6- 10^ cm“^ 
for HCN, even in hot (in terms of kinetic temperature) but diffuse media, 
the excitation temperature is very low, and tends to Tex = Tbg = 2.76 K. 
Therefore, molecular absorption is weighted towards detecting the diffuse 
media. 

The utility of molecular absorption lines is also to detect all high density 
tracers but this time in diffuse media, and therefore explore their spatial 
extension. Due to their high dipole, these molecules populate more their 
ground levels, and are not detected in emission in diffuse media. Their velocity 
integrated opacity is directly proportional to the square of the permanent 
electric dipole moment. This means that a molecule like HCO+, which is 
^ 10“"^ less abundant than CO, but has a dipole moment ~ 10^ times larger 
than that of CO, can be as easily observed. 

Finally, one of the main advantage of absorption is that, due to the small 
extent of the background continuum source, the signal detected will be only 
proportional to the opacity, which is not spatially diluted. Once the back- 
ground QSO is detected in the continuum, there is no distance dependence 
to detect the absorption line. Molecular absorption lines are as easy to detect 
at z « 0 as at 2 « 1, except that at small distances, emission can make ab- 
sorption line measurements more difficult. The following will illustrate this, 
by showing HCO'*', HCN and CS absorption from CenA at a distance of 
~ 4Mpc (Wiklind & Combes 1997a) and from PKS 1830-211 at a distance 
of ~ 4000 Mpc (Wiklind & Combes 1996a). Both set of spectra have been 
obtained with similar integration time. 

7.2 Molecular Absorption in the Galaxy 

Absorption lines in the millimeter is more difficult to observe than in the cm 
range, since continuum sources are weaker (the flux is going as S (x v~°‘) 
and smaller. They are also more difficult to observe than at high redshift, 
since there is the confusion with emission. It is only with the advent of mm 
interferometers that the tiny absorption regions have been separated from 
emission. The literature is thus relatively recent, beginning with the molecular 
absorption in the Milky Way detected in front of BLLac by Marscher et 
al. (1991). 

When looking at all possible background source to detect molecular ab- 
sorption in the Milky Way, one realizes the small filling factor in surface of 
the molecular component even of the diffuse CO medium: only 9% of the 3C 
sources observed have CO emission in front of them (Liszt & Wilson 1993). 
Among these line of sights, 60% show absorption lines. 

The surprise of the first observations is that many molecules were de- 
tected, even towards very diffuse media, with extinction of only Ay ~ 1 mag: 
already a very abundant chemistry exists at very low column densities, which 
was not expected (Lucas & Liszt, 1994). A variety of lines, such as ^^CO, CN, 
HCO+, HCN, HNC, C2H, N2H+ are reported with line ratios quite variable 
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Fig. 41. CN, HNC, HCN and HCO’^ absorption spectra toward B 0355+508. CN 
and HCN have well separated hyperfine components. The HNC spectrum, in which 
the hyperfine structnre only widens and does not create new components, helps 
to distinguish which of the HCO+ features are present in the CN spectrum (from 
Liszt & Lucas 2001). 



from one line of sight to the other (Lucas & Liszt 1994, 1998). Figure 41 
illustrates the richness in molecules (Liszt & Lucas 2001). Among a survey of 
30 lines of sight, HCO'*' absorption is detected 30% as often as HI absorption, 
and even more frequent than CO (Lucas & Liszt 1996). 

The bigger surprise is the strength of HCO'*' absorption, in these diffuse 
media; there is not only the higher critical density to see HCO^ in emission, 
explaining why HCO+ is so “cold”, but the chemistry has to be revised to 
be consistent with observations. Some lines are very optically thick (^^CO is 
detected) and for others t <C 1, and the hyperfine lines of HCN are observed 
in the canonical ratio 5:3:1 as expected. The lines are quite narrow, of the 
order of AV = 0.5 — 1 km/s. 

Absorption lines are sometimes variable over a year time-scale, suggesting 
the presence of moving clumpy material, of sizes 10 — 100 AU in front on 
the continuum source (Marsher et al. 1993). Also spatial fluctuations in the 
chemistry only have been invoked (Liszt & Lucas 2000). 

To explain the large variations from region to region of the abundances of 
CO versus HCO+, variable by factors up to 20, several propositions have been 
studied; one of them is the chaotic chemistry, where only a tiny difference in 
initial conditions can change completely the final result. In certain conditions, 
there are double steady state chemical solutions, between which the medium 
experiences bi-stability (Le Bourlot et al. 1993). Two phases can then be 
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Fig. 42. (Left) Variation of and column density with HCO+. The 

slopes of the regression lines are 1.4 for and 1.6 for ^®CO. (Right) CO and H 2 
column densities determined in the UV, taken from the compilation of Federman 
et al. (1994, revised in 1995). The slope of the regression line is 2.02. (from Liszt & 
Lucas 1998). 

found, either low ionisation or high ionisation for instance, and very close 
regions in space could end up in one or the other solution, randomly. 

Another proposition is that CO can form rapidly from HCO^ in diffuse 
clouds. H 2 can form at relatively low density. Whenever H 2 is there, HCO+ is 
present with a minimum abundance HCO“''/H 2 = 2-10“®, and then CO forms 
by recombination of HCO'*". There is evidence of a rather sudden CO turn on, 
when the column density of H 2 crosses the value 10^° — 10^^ cm“^, as shown 
in Fig. 42. The fact that HCO+ is much more extended than CO is supported 
by its good linear correlation with OH: A(HCO+) = 0.03 — 0.05A(OH), even 
at low column density. 

The CO molecule then forms later, at higher column densities, when C^ 
recombines. Eventually, diffuse clouds have similar chemical abundances as 
in dark clouds, except for the CO molecule. N(CO) increases abruptly when 
N(HCO'*') reaches 1 — 2-10^^ cm“^. The correlation between these two column 
densities observed in absorption reveals a non-linear slope of 1.5. Already, by 
UV absorption measurements, Federman et al. (1994) had found a non-linear 
relation, with a power-law slope of ^ 2 between CO and H 2 column densities 
(cf. Fig. 42). This may explain the observed line-width of HCO+ spectra, 
always larger by at least 15% than the CO line-width: the HCO'*' is like 
an envelope around the dense molecular clouds, where CO is absent. Sur- 
prisingly the ^^CO absorption line is relatively strong. Fractionation, much 
more efficient than selective photodissociation, is certainly the cause, through 
reactions like: 



12cO-|-13c+ ^ i3cO-b^^C+ 



(55) 
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7.3 Extragalactic Molecular Absorptions 

When the line of sight of a radio-loud QSO crosses a galaxy, and also a 
molecular cloud (a quite rare event), absorption lines are detected in the 
millimeter and centimeter rotational lines of many molecules. 

These molecular absorption objects are the continuation at high column 
densities (10^^ — lO^'^cm”^) of the whole spectrum of absorption systems, 
from the Lyman-a forest (10^^ — 10^® cm“^) to the damped Lyman-a and HI 
21cm absorptions (10^® — 10^^ cm“^). It is currently thought that the Lyman- 
a forest originates from gaseous filaments in the extra-galactic medium, that 
the damped and HI absorptions involve mainly the outer parts of spiral galax- 
ies. The molecular absorptions concern the central parts of galaxies. These 
molecular events are much rarer, the number N versus the column density 
decreases as a relatively steep power law. 



Comparison with Emission: The absorption technique is much more sen- 
sitive than emission, as soon as a molecular cloud is found in front of a 
background radio QSO. At high redshift for instance, the detection limit is 
10^° Mq, while the absorption limit does not depend on redshift, and can 
be as low as IMq, depending on its column density. The absorption limit 
is in optical depth, the background source in general is highly diluted, but 
its brightness can reach up to 10^^ K, while the flux detected is less than 
1 Jy (or less than 100 mK with current telescopes). If for galactic studies, 
interferometric resolutions are required, since absorption is generally buried 
among strong emission of local molecular clouds, single dish observations are 
sufficient for all extragalactic studies. The nearest absorption is Centaurus A, 
where both emission and absorption are detected with the same amplitude. 



Centaurus A: The absorption was first detected by Eckart et al. (1990) in 
CO and HCO”^. Many other lines are detected in absorption only (see Fig. 43; 
Wiklind & Combes 1997a). After about 10 years, no temporal variation is 
detected. This helps to put constraints on the source size, which must be 
larger than 500 AU. The analysis of all the lines show that the gas is in 
average of low density, low excitation and low kinetic temperature. Lines are 
optically thin, and there is a particularly wide absorption in HCO'*’, which 
could correspond to a nuclear disk (Wiklind & Combes 1997a). 



Higher Redshift Absorptious: The first high -2 absorption was detected 
towards the BLLac object PKS 1413-1-135 (Wiklind & Combes 1994), af- 
ter many unfruitful searches towards DLA (Damped Lyman-a Absorptions). 
Since then, 4 systems are known, but they remain rare. Half of them are 
gravitationally lensed objects, and true intervening absorption, PKS 1830- 
211 and B 0218-1-357. The absorbing molecular clouds belong to the lensing 
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Fig. 43. Absorption lines towards Centaurus A of HC0^(1 0), H^®CO'*'(l — 0), 

HCN(l-0),HNC(l-0), CS(2-1) and N2H+(l-0). Only N 2 H+ remains undetected. 
The spectra have been normalized to a continuum level of unity (from Wiklind & 
Combes 1997a). 
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galaxy. This is a way to find molecules in normal galaxies at high redshift 
(not only in ultra-luminous galaxies). 

Redshifts range up to z ~ 1, while the background QSO up to z ^ 2; 
it is difficult to find higher redshifts QSO, that are strong enough in the 
millimeter. Large variety of line widths, optical depths, sometimes several 
components, at different velocities, are detected along the same line of sight. 

In addition to the four detected molecular absorption line systems, a sys- 
tematic survey was carried out of about a hundred candidates, selected from 
flat-spectrum continuum sources. The continuum needs to be at least 0.2 Jy 
to allow detection of intervening molecular gas. The redshift of the absorb- 
ing candidate is known, either from previously detected HI absorption; 21 cm 
or damped Lyman-a, which is the case for PKS I4I3-I-I35 and B 02I8-I-357 
(Carilli et al. 1992, 1993; Wiklind & Combes 1994, 1995) or from optical lines 
emission of a galaxy on the line of sight to a radiosource; B 1504-1-377 (Wik- 
lind & Combes 1996b). When the continuum source is strong enough, at least 
1 Jy, and no redshift is known, it is possible to search for absorption lines by 
scanning in frequency in a manner similar to what was done for PKS 1830-210. 
This last method is the most promising with the new generation millimeter 
instruments, that will gain an order of magnitude in sensitivity. Indeed, the 
best candidates are the most obscured ones, where no redshift is available. 
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Fig. 44. {Left) A comparison of the CO(l — 0) and 21cm HI absorption towards 
PKS 1413+135. In both cases the optical depth is plotted. (Middle) HCO^(2 — 1) 
absorption lines at two different epochs. (Right) Normalized spectra of CO(l — 0) 
and HCO'^(2 — 1) both obtained in May 1994. Note that the relative intensities of 
the two velocity components have varied (from Wiklind & Combes 1997b). 



PKS 1413+135 at z ^ 0.247: This is a particularly narrow absorption, 
with two components, each less than 1 km/s wide. There has been for a long 
time a debate to know whether the continuum source is a background object, 
or is the central AGN in the galaxy. The position of the source coincides with 
the center, and if it was located much behind, a gravitational deformation 
should occur, which is not the case. Lamer et al. (1999) conclude that the 
galaxy at z = 0.247 is indeed the host of the BLLac. The object is highly 
obscured, and appears a spiral galaxy edge-on. The radio source, as well as the 
optical AGN are highly time-variable. The GO line is optically thick, which 
corresponds to what has been deduced in the optical, N(H 2 ) > 10^^cm“^, 
Ay > 30 mag McHardy et al. (1994). 

Some temporal variability has been detected, as shown in Fig. 44, and 
might suggest small-scale structure. The opacity ratio between the two com- 
ponents has varied by 2.3 over 2 years (Wiklind & Gombes 1997b). The 
variability is seen in the GO, not in HGO+. GO is more optically thick, and 
HGO^ belongs more from the diffuse component. 

The variations could be due to the line of sight change due to the variabil- 
ity of the continuum source. This source, as a BLLac, is likely to experience 
superluminal motions. The radio core is unresolved at 2.3 mas or 7pc. As- 
suming a size of the order of lOy^as = 0.03 pc, the proper motions of small 
molecular clumps in front of the background sources, at 250 km/s = 50 AU/yr 
are insufficient to explain the observed variability (time-scale of 100 yr would 
then be expected). Velocities larger than 25 000 km/s are required. The mo- 
tion must come from the core itself. 

The interpretation of the physical conditions of the gas is difficult. The 
data are compatible with either a multi-component model with similar filling 
factors, or with dense clumps embedded in a diffuse medium. The diffuse 
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component accounts for most of the absorption, while the clumps comprise 
most of the mass. 

Because of the very narrow velocity width, the cloud along the line of 
sight must be quite small, of the order of Ipc according to the size/line- 
width relation followed by molecular clouds. The density is estimated n(H 2 ) ~ 
10"^ cm“^. 

B 1504+377 at z ^ 0.672: Seven different velocity components are de- 
tected in absorption, in front of the continuum source, which is local to the 
galaxy. The large separation of 330km/s is surprising for a line-of-sight pass- 
ing by the nucleus, but this suggests the presence of a nuclear ring, with non- 
circular motions, and a forefront spiral arm. The abundance ratio HNC/HCN, 
which is sometimes used as thermometer, suggests that the kinetic temper- 
ature is equal to the excitation temperature, therefore indicating a dense 
medium. 

HCO’*' is enhanced by 10 — 100, as in the Galaxy; there must therefore 
exist both the diffuse and dense media on the line of sight (Wiklind & Combes 
1996b). 



B 0218+357 at z = 0.685: This is one of the smallest separation double 
source gravitational lens. One of the two images is surrounded by an Einstein 
ring visible in the radio continuum with VLBA measurements (Patnaik et 
al. 1995). The image separation is 335 mas (1.8 kpc). 

This is the absorption system with the largest column density > 10^"^ cm“^ 
(Wiklind & Combes 1995). All three CO(2 — 1) isotopic lines are optically 
thick, as shown in Fig. 45. This was an excellent opportunity to search for 
molecular oxygen in interstellar space. This crucial molecule has not yet been 
detected in the Milky Way, in part because large telescopes on the ground 
are prevented by the atmospheric lines; yet O is more abundant than C, 
and chemical models predict a large abundance for O 2 . Because of its large 
redshifts, the main O 2 lines can be observed from the ground in the B0218 
molecular clouds, without atmospheric absorption. The result of the search is 
an upper limit of O 2 /CO < 2 • 10“^ at 1 ct (Combes et al. 1997). This is much 
below theoretical expectations and suggests that oxygen is frozen on to dust 
grains, or that steady-state chemistry is never reached. In the latter case, 
most of the oxygen would remain atomic even in dense molecular clouds. 

In a similar way, it is not possible to observe from the ground the fun- 
damental line of water, and to get insight on its spatial extension. Only 
the isotopes and HDO have been detected, but in dense cores, with a 

deduced H 2 O/H 2 abundance ratio between 10“^ and 10“®. It was thought 
until recently that these abundances concerned only the neighbourhood of 
star-forming regions, such as the Orion hot core, where water ice is evapo- 
rated from grains. However, Cernicharo et al. (1997) detected with the ISO 
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Fig. 45. {Left) Normalised spectra of the three isotopic molecules ^^CO, ^^CO, 
C^®0, in the J = 2 — 1 line towards the gravitationally lensed quasar B 0218+357. 
The three lines are optically thick, meaning that the column density of H 2 is of the 
order of 10^^ cm“^ or more (from Combes & Wiklind 1995). {Right) Spectrum of 
ortho-water in its fundamental line at 557 GHz, redshifted at 331 GHz, in absorp- 
tion towards B 0218+357. The line has the same width as the previously detected 
HCO'^(2 — 1) and CO(2 — 1) lines. Spectra have been normalised to the continuum 
level completely absorbed (from Combes & Wiklind 1997). 



satellite water in absorption at 179 /i in front of SgrB2, and this revealed 
that cold water was ubiquitous. 

The detection with the IRAM-30 m telescope of ortho-water in its funda- 
mental line at 557 GHz confirms this result (Combes & Wiklind 1997). The 
line is highly optically thick, and has about the same width as the other 
optically thick lines detected in absorption in this cloud (see Fig. 45). If the 
excitation temperature was high (as in the Orion hot core), we would have 
expected to detect also the excited line at 183 GHz (redshifted at 109 GHz). 
An upper limit on this line gives us an upper limit on Tex of 10 — 15 K, and 
an estimation of the optical depth of the 557 GHz line of ~ 40 000 (Combes 
& Wiklind 1997). This leads to an H 2 O/H 2 abundance ratio of 10“®, in the 
upper range of expected values. 

It was also possible to try the detection of the LiH molecule at 444 GHz, 
again never detected until now, although fundamental for primordial cloud 
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Fig. 46. {Left) The gravitationally lensed quasar PKS 1830-211 consists of two im- 
ages, embedded in an Einstein ring, as shown in this cm radio continuum map (from 
Jauncey et al. 1991). {Right) Results of the fit of visibility data from April 1996 
observations: the integrated spectrum is at bottom, and the phase-center position 
in right ascension at top: this shows that the V = — 147.5kms“^ component is 
absorbed in front of the NE image (from Wiklind & Combes 1998). 



and galaxy formation. A tentative detection of a very narrow line has been 
reported (Combes & Wiklind 1998). The LiH abundance would be L1H/H2 
3 - 10 - 12 . 



PKS 1830-211 at 2 ; = 0.88582: This gravitational lens reveals two images, 
embedded in an Einstein ring (see Fig. 46). Two velocity components are 
detected, and interferometric data show that each velocity component absorbs 
in front of a different gravitational image (Wiklind & Combes 1998). 

The measurement of a few dozens of molecular lines in this source bring 
some insight in the evolution of chemical conditions with redshift. The various 
line ratios that have been obtained in the many absorptions systems show 
that there does not seem to be variations vith redshift, but there is a large 
scatter, even at z = 0 (Lucas & Liszt 1994, 1996). 



Monitoring of Temporal Variations: The fact that at a given velocity, 
only one image is covered by a molecular cloud, allows to know the flux ratio 
between the two images, even with a single dish, without resolving the source. 
This has been used to monitor the temporal variability, and measure a time- 
delay between the two images, to deduce a value of the Hubble constant 
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Fig. 47. {Left) Measurements of the Cosmic Microwave Background Radiation tem- 
perature at various redshifts (from Srianand et al. 2000). The asterisk is the mm 
constraint obtained from PKS 1830-211, at 2 = 0.88582. {Right) Variation with time 
of the fine structure constant a. Comparison of the mm/HI constraints {crosses, 
from PKS 1413-1-135 at 2 = 0.247 and B 0218-1-357 at 2 = 0.685) which do not show 
any variation, with the recent optical constraints {star and dots) (from Murphy et 
al. 2001). 



(Wiklind Combes 2001). At the same time, the monitoring carried out over 
3 years, with one observation of Ih every week, has revealed evidence for 
micro-lensing by either stars or molecular clouds in the lensing galaxy. 



Measure of Molecular clouds are usually very cold, with a kinetic 

temperature of the order of 10 — 20 K. The excitation temperature of the 
molecules could be even colder, close to the background temperature Tbg. 
This occurs when the absorption involves diffuse gas, which is the case in 
front of PKS 1830-211 where Tex ~ Tbg for most of the molecules. This yields 
one constraint for the measurement of the cosmic background temperature as 
a function of 2 . Other measurements have been recently obtained at 2 = 2.34 
from H 2 , Cl and C+ lines, by Srianand et al. (2000), see Fig. 47. 



Variation of Constants: Finally, the large spectral resolution (up to i? = 
10®) reached in millimetric spectroscopy, and the narrowness of the molecular 
lines make it possible to improve the constraints on the variation of the fine 
structure constant a = ^ j he with space and time. 

The method of alkali doublet in optical has been refined to include many 
multiplets, and a positive result has been reported recently by Webb et 
al. (2001). Only a variation at high 2 is detected. Comparison of HI 21-cm ab- 
sorption with corresponding molecular absorption spectra allows to constrain 
the variation at other redshifts (Murphy et al. 2001). 
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7.4 Conclusion 

Absorption is a precious tool to observe cold gas at any redshift, although 
detection is favored towards diffuse gas, with low excitation. Much smaller 
masses are detected than in emission. In the Milky Way, a new chemistry has 
been discovered in diffuse clouds, that is nearly as rich as that of dense clouds. 
The evolution with 2 of physical and chemical conditions of the molecular 
component of galaxies can be investigated. Moreover, for intervening galaxies, 
this is the only way to detect cold gas in galaxies that are not ultra-luminous. 

8 Molecules at High Redshift, Perspectives 

8.1 Current State of the Art 

To explore the history of star formation, and its efficiency, it is of first impor- 
tance to estimate the molecular content of galaxies as a function of z. The 
first detection of CO lines at high redshift was done in the hyperluminous 
object IRAS I02I4-I-4724 at z = 2.28 by Brown & Vanden Bout (1992), then 
confirmed by Solomon et al. (1992); it was a big surprise, since an enormous 
H 2 mass was first derived. It has now been revised, knowing that the flux 
is considerably amplified by gravitational lensing (by a factor ~ 50) . It may 
still be surprising to find such huge amounts of CO molecules, especially since 
the gas is expected to have lower metallicity at high z. Theoretical calcula- 
tions have shown however that in a violent and concentrated starburst, the 
metallicity could reach solar values very quickly (Elbaz et al. 1992). 

Today, nearly 20 objects have been detected in CO lines at high z: they are 
often gravitationally amplified, either being multi-imaged by a strong lens. 
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Fig. 48. {Left) Image of the Cloverleaf quasar H 1413-1-117, at z = 2.56, mapped 
in the CO(7 — 6) line with the IRAM interferometer with a 0.6” beam (from Kneib 
et al. 1998). (Right) The quasar BR 1202-07 at z = 4.69, mapped in the dust 
continuum at 1.3mm with the IRAM interferometer. The insets show the CO(5 — 4) 
spectra (from Omont et al. 1996). 
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like the Cloverleaf quasar H 1413+117 at z = 2.558 shown in Fig. 48 (Barvai- 
nis et al. 1994), the lensed radiogalaxy MG 0414+0534 at z = 2.639 (Barvainis 
et al. 1998), or the possibly magnified object BR 1202-0725 at z = 4.69 (Ohta 
et al. 1996, Omont et al. 1996); or they have been intentionally searched for 
behind a foreground galaxy cluster, to benefit of a weak amplification, like the 
submillimeter-selected hyperluminous galaxies SMM 02399-0136 at z = 2.808 
(Frayer et al. 1998), and SMM 02399-0134 at z = 1.062 (Kneib et al. 2002). 
Often several high-J CO lines are detected, revealing the high temperature 
of the gas (typical of a starburst at ~ 60 K). Higher temperatures are rare, 
as in the magnified BAL quasar APM 08279+5255, at z = 3.911, where the 
gas temperature derived from the CO lines is ~ 200 K, maybe excited by 
the quasar (Downes et al. 1999). There are in the list some objects that do 
not reveal any amplification, as the weak radio galaxy 53W 002 (Scoville et 
al. 1997b) at z = 2.394, or the radio-quiet quasar BRI 1335-0417, at z = 4.407 
(Guilloteau et al. 1997). These objects possess the largest molecular contents 
known (~ 10^^ Mq with a standard CO/H 2 conversion ratio), as shown in 
Table 2. The derived molecular masses are so high that H 2 would constitute 
between 30 to 80% of the total dynamical mass (cf 4C 60.07, Papadopoulos 
et al. 2000). The standard conversion ratio might there be reduced by factors 
3 — 4 (Solomon et al. 1997). 

The strategy to detect high-z CO lines is to search for millimetric con- 
tinuum first, since the dust emission is much easier to detect because of the 
negative K-correction. All objects of the Table 2 were first detected in dust 
emission, either by IRAS or in the submillimeter range through the bolome- 
ter arrays SCUBA at JCMT, or MAMBO at IRAM. One advantage of the 
CO molecule is its relatively tight rotational ladder: at high z, the separa- 
tion of the lines is much reduced, and many lines may be observed for the 
same object, in the millimetric range. Since most of the objects detected are 
amplified, it is however difficult to know the true line ratios, and therefore 
the excitation of the gas, since the regions emitting in the various lines may 
have a different spatial extension, and therefore a different lens amplification 
factor. 

What the detections reveal up to now is that the objects are giant star- 
bursts, with hot and dense gas. Only in these conditions, the high-J levels of 
CO can be excited, and therefore the CO be detected at all at high z. Indeed, 
when excited, the high-J lines have a much larger flux than the CO(l — 0) 
line, see Fig. 49. This is a crucial advantage of the molecular tracer, with 
respect to the HI-21 cm emission for instance, which is almost impossible to 
detect beyond z = 0.2. 

The 16 objects detected in CO between 1 < z < 5 are displayed in 
Fig. 49 together with the detection limits of present instruments. It is clearly 
visible that all these objects would not have been detected in CO(l — 0), but 
benefit from the higher detection performance for the high-J lines, as will be 
developed below. One object has been detected in the CO(l — 0) line with the 
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Table 2. CO data for high redshift objects. 



Source 


Z 


GO S AE M Hz Ref 
line mjy km/s 10^° M© 


SMM 02399-0134 


1.062 


2-1 


3 


500 


2* 


1 


0957-b561 


1.414 


2-1 


3 


440 


0.4* 


2 


HR 10 


1.439 


2-1 


4 


400 


7 


3 


F 10214-f4724 


2.285 


3-2 


18 


230 


2* 


4 


53W002 


2.394 


3-2 


3 


540 


7 


5 


H1413-bll7 


2.558 


3-2 


23 


330 


2-6* 


6 


SMM 14011-b0252 2.565 


3-2 


13 


200 


5* 


7 


MG0414-b0534 


2.639 


3-2 


4 


580 


5* 


8 


SMM 02399-0136 


2.808 


3-2 


4 


710 


8* 


9 


6C 1909-f 722 


3.532 


4-3 


2 


530 


4.5 


10 


4C 60.07 


3.791 


4-3 


1.7 


1000 


8 


10 


APM08279-b5255 


3.911 


4-3 


6 


400 


0.3* 


11 


PSS 2322+1944 


4.120 


5-4 


12 


300 


25 


12 


BR 1335-0414 


4.407 


5-4 


7 


420 


10 


13 


BR0952-0115 


4.434 


5-4 


4 


230 


0.3* 


14 


BR 1202-0725 


4.690 


5-4 


8 


320 


10 


15 



* corrected for magnification, when estimated 



Masses have been rescaled to Hq = 75km/s/Mpc. When multiple images are re- 
solved, the flux corresponds to their sum. (1) Kneib et al. (2002); (2) Planesas 
et al. (1999) (3) Andreani et al. (2000); (4) Solomon et al. (1992), Downes et 
al. (1995); (5) Scoville et al. (1997b); (6) Barvainis et al. (1994); (7) Frayer et 
al. (1999); (8) Barvainis et al. (1998); (9) Frayer et al. (1998); (10) Papadopoulos 
et al. (2000); (11) Downes et al. (1999); (12) Cox et al. (2002); (13) Guilloteau et 
al. (1997); (14) Guilloteau et al. (1999); (15) Omont et al. (1996) 



VLA, APM 08279-1-5255, at z = 3.91 (Papadopoulos et al. 2001). It appears 
then that the molecular mass may be 10 — 100 times higher than estimated 
from the higher lines. 



Extremely Red Objects (EROs) : In the aim to search for primeval galax- 
ies, Elston et al. (1988) had identified extremely red objects that are conspic- 
uous only in the near-infrared (R — K > 5). Maybe 10% of the submm sources 
could be EROs (Smail et al. 1999). A proto-typical ERO at z = 1.44 (Dey et 
al. 1999) has been detected in submm continuum (Cimatti et al. 1998), and 
in CO lines (Andreani et al. 2000). 

The SED (Spectral Energy Distribution) for a typical starburst is dis- 
played in Fig. 50 in its rest-frame. Most of the optical flux is obscured and 
re-radiated in the infrared. 

The submillimetric objects detected until now could be evolving in present 
day bright elliptical galaxies, they have the same comoving density, at 1 — 2 L* 
(Frayer et al. 1999), but they certainly are of a different nature and have a 
different history than the local spiral galaxies like the Milky Way. 
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Fig. 49. H 2 masses for the CO-detected objects at high redshift {full dots), com- 
pared to the ultra-luminous-IR sample of Solomon et al. (1997, open pentagons), to 
the Coma supercluster sample from Casoli et al (1996, filled triangles), and to the 
quasar 3C48, marked as a star (Scoville et al. 1993, Wink et al. 1997). The lines 
delineated by various symbols indicate the 1 a detection limit at the IRAM-30 m 
telescope of S{CO) = l.OJykm/s with the 3mm receiver {hyphens), 2.0Jykm/s 
with the 2mm {triangles) and 1.3 mm {crosses) receivers, rms that we can reach in 
6h integration. Note the absence of detected objects between 0.36 and 1 in redshift. 
The points at high 2 ; can be detected well below the sensitivity limit, since they are 
gravitationally amplified. 



8.2 Modelling of a Starburst 

The physical conditions of the molecular gas is only known in local starbursts, 
and the first predictions will be done in assuming the same densities, gas 
concentration and star formation efficiency for the high-redshift starburst 
objects. The active region is generally confined to a compact nuclear disk, 
sub-kpc in size (Scoville et al. 1997a; Solomon et al. 1990, 1997). The gas 
is much denser here than in average over a normal galaxy, of the order of 
10^ cm“^, with clumps at least of 10® cm“® to explain the data on high density 
tracers (HCN, CS...). As already shown before, the interpretation of the 
data requires at least two density and temperature components, and a simple 
model has been computed with two temperatures, at 30 and 90 K (cf. Combes 
et al. 1999). The total molecular mass considered is 6T0^® Mq and the average 
column density A^(H 2 ) of 10^^cm“^, typical of the Orion cloud center. 

Going towards high redshift ( 2 ; > 9), the temperature of the cosmic back- 
ground Tbg becomes of the same order as the interstellar dust temperature, 
and the excitation of the gas by the background radiation competes with 
that of gas collisions. It might then appear easier to detect the lines (Silk 
& Spaans 1997), but this is not the case when every effect is taken into ac- 
count. To have an idea of the increase of the dust temperature with z, the 
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Fig. 50. Spectral modelling applied to one known local starburst (NGC6090). The 
stellar population library is derived from the PEGASE package (Fioc & Rocca- 
Volmerange 1997) for different ages and metallicities. The stellar light is obscured 
according to the extinction law constrained by nearby starbursts by Calzetti et 
al. (2000). The absorbed flux is reemitted in the IR. Relying on Desert et al. (1990), 
this flux is distributed into 3 dust components: big grains, small grains and PAH 
(from Melchior et al. 2001). 



simplest assumption is to consider the same heating power due to the star- 
burst (this comes from the assumed similar geometry for the gas, and similar 
star-forming efficiency) . At steady state, the dust must then radiate the same 
energy in the far-infrared that it receives from the stars and the background. 
The stellar power is thus proportional to the quantity — T®g, if the dust 
is optically thin, and its opacity varies in , with /3 = 2. In these conditions, 
the energy re-radiated by the dust is proportional to while the the 

energy it received from the cosmic background is proportional to T®g. Since 
f3 can also be equal to 1 or 1.5, or the dust be optically thick, the quantity 
to keep constant with z is then which is in some sense the other 

extreme; this in fact does not change fundamentally the results. 
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Simple Two Component Model: The central starburst can be considered 
as the accumulation of very active star-forming complexes, such as Orion, 
typically 8.6 • lO”^ clouds of 700 Mq each. The total mass is 6 • 10^° Mq con- 
centrated in a circum- nuclear region of Ikpc in diameter. The two cloud 
components have both low filling factors. The dense and hot component, cor- 
responding to the star forming cores, has a surface filling factor of /g = 1, 
but a velocity filling factor fy = 0.03, such as the product fsfv "C 1. Each 
cloud is modeled with an individual velocity dispersion of lOkm/s, embed- 
ded in the rotational gradient of the galaxy, 300km/s. The warm and more 
diffuse component has /s = 100, but still fsfv = 3 only, it only begins to 
be macroscopically optically thick. The computed dust temperature for the 
warm component begins to differ from 30 K only at z = 5, and is 34 K, 58 K 
and 85K at 2 = 10, 20 and 30 respectively. 



LVG Approximation: To compute the excitation of the CO molecules, and 
the population of the various J levels, a very simple approximation, justified 
by the turbulence inside the clouds, is to assume that there exists a large 
velocity gradient (LVG) in the medium, to reduce the optical depth, and to 
limit the regions that can absorb the photons to the immediate neighborhood 
of their emission. 

It is in addition verified that most of the clouds, which are individually 
optically thick in the CO lines, are not overlapping on the line of sight, at a 
given velocity, and this is due to the low velocity filling factor. There is only 
mild overlap, that is taken into account in the simple model. For the sake of 
comparison, a model is also considered of a spherical homogeneous medium, 
at T = 50 K, density 10^cm“^, iV(H 2 ) = 3.5 • lO^^cm”^. 



Model Results: With the two-component disk model, the excitation is 
almost LTE (high excitation), as shown in Fig. 51. It is not the case for 
the homogeneous sphere model. The radiative excitation by the background 
temperature at high z is clear in the figure. 

The predictions of the line and continuum intensities as a function of 
redshift and frequencies are plotted in Fig. 52. The CO flux increases with J 
as the frequency to the square power, since the line is optically thick, and in 
the Rayleigh- Jeans domain. But there is no negative K-correction, as in the 
continuum, and it is always more difficult to detect the CO lines at higher 
redshift, although the high- J levels help. In the continuum, as seen on Fig. 52, 
the slope of the curves are of the d**' power, instead of 2 nd, and they overlap 
at high z. 

The model has been computed with lower than solar metallicity, and 
therefore a lower abundance than the standard CO/H 2 = 10“^ (see Fig. 53). 
The flux is first slow to decrease because of the slight macroscopic optical 
depth at the beginning, but then falls rapidly. 
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Lower J level 

Fig. 51. Relative populations of the CO molecules, for the warm cloud component. 
The 8 curves correspond to redshifts z = 0.1, 1, 2, 3, 5, 10, 20, 30. The value of 
Tk indicated is for 2 = 0, and the temperature increases significantly from 2 = 5. 
The full curves are the actual distributions, and the dotted curves correspond to 
the LTE ones, at the corresponding Tk (from Combes et al. 1999). 
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Fig. 52. Expected flux for the two-component (30 and 90 K) cloud model, for var- 
ious redshifts 2 = 0.1, 1, 2, 3, 5, 10, 20, 30, and qo — 0.5. {Left) are the CO lines, 
materialised each by a circle (they are joined by a line only to guide the eye). (Right) 
is the continuum emission from dust. It has been assumed here that — T^g is 
constant with 2 (from Combes et al. 1999). 



Comparison with Continuum: The CO emission does not benefit from 
a negative K-correction, in part because of optical thickness, and it will be 
much more difficult to observe than dust emission, especially for more normal 
galaxies at high redshift, where the high- J lines might not be excited. 

Another point to note on Fig. 52, is that the maximum of CO emission is 
always at longer wavelengths than for the continuum (notice the different A 
scales). The emission peaks at a frequency lower by in average a factor 5 than 
for the continuum. For T = 90K, the dust peaks at ~ 60 ^m, and the CO at 
~ 600 /im. This comes from the fact that the lines always reflect the energy 
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Fig. 53. (Left) Expected flux for the two-component cloud model, as in Fig. 52, 
but with a CO abundance 10 times lower, i.e., CO/H 2 = 10“®. (Right) Same for 
CO/H 2 = 10"®. 



difference between two levels, and only the upper level correspond to the 
temperature of the gas. This is an advantage, given the largest atmospheric 
opacity at high frequencies. 

The increase of the background temperature does not help in detecting 
the CO lines, in spite of the higher degree of excitation. Indeed, molecular gas 
at T = Tbg is always undetectable, and the signal received is always obtained 
by comparison with the off-source position, which is at the T^g temperature. 

Given these ffux predictions, it is now possible to choose the best strategy 
to observe a high -2 galaxy, depends on excitation and redshift. It is not needed 
to go at very high frequency, and lower frequency instruments (cf. Table 3) 
would anyway be complementary to estimate the molecular masses (see the 
table of integration times with various instruments and frequencies in Combes 
et al. 1999). 



Table 3. Panorama of mm and submm instruments. 



Telescope 


Area 


^min 


^min 


IRAM-30 m 


0 

3 


1 mm 


10” 


IRAM-PdB 6 ant 15m=1060m^ 


1 mm 


0.5” 


NRO 


6 ant 10m=509m^ 


1 mm 


0.5” 


OVRO 


6 ant 10 m= 509 m^ 


1 mm 


0.5” 


BIMA 


10 ant 6m=282m^ 


1 mm 


0.5” 


CARMA* 


791 m^ 


1 mm 


0.5” 


SMA 


7 ant 6m=200m^ 


0.3 mm 


0.1” 


GBT 


ant 100m=7854m^ 


2.6 mm 


7” 


In project 


LMT 


ant 50 m=1963 


1 mm 


6” 


ALMA 


64 ant 12 m=7238 m^ 


0.3 mm 


0.1 - 0.01” 


EVLA 


35 ant 25m=17200m^ 


6 mm 


0.004” 



* project, combining OVRO and BIMA antennae 
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8.3 Prediction of Source Counts 

How many sources will be detectable with the future instruments? In the 
frame of the hierarchical theory of galaxy formation, it is possible to esti- 
mate the number of starbursts as a function of 2 ;, such as to be consistent 
with the main observations: star formation history, level of cosmic infrared 
background radiation, and the submm continuum source counts. The cos- 
mology adopted is an Einstein-de Sitter model, 17 = 1, with no cosmological 
constant, and Hq = 75km/s/Mpc, go = 0-5- The number of mergers as a 
function of redshift can be easily computed through the Press-Schechter for- 
malism (Press & Schechter 1974), assuming self-similarity for the probability 
of dark halos merging, and an efficiency of mergers in terms of star-formation 
peaking at z ~ 2 (i.e. Blain & Longair 1993). 

The main parameter use to fit the data is the rate of energy released in a 
merger (or the life-time of the event): this rate must increase strongly with 
redshift (cf. Blain et al. 1999). Once the counts are made compatible with 
the submm observations, the model yields the various relative contributions 
of the different redshifts. Figure 54 shows that the intermediate redshifts 
dominate the continuum source counts (2 < z < 5), if we allow the star 
formation to begin before z = 6. At higher dust temperature, the counts are 
dominated by the highest redshifts (z > 5). 

The same curves can be obtained for the CO lines, as shown at the bottom 
of Fig. 54. The scale now indicates much lower numbers, as expected, and 
they are dominated by the high redshift sources. It is not useful to observe at 
A below 1 mm for high-z protogalaxies, but instead to shift towards A = 1 cm. 
At 2 mm wavelength, the dominant contribution is from 2 < z < 5, and at 
5 mm, the dominant sources are at z > 5. 

8.4 Conclusions 

Already some exceptional objects have been detected in the CO lines at high 
redshift, and the new generation of mm instruments will make a breakthrough 
in this domain, in detecting a few objects per square degree. The fraction of 
gas mass in galaxies is expected to increase with redshift as well as the fraction 
of starburst and their efficiency (shorter dynamical time) . 

The CO lines bring much more information than the mm continuum, in 
giving the mass of H 2 in the galaxy, the kinematics, the efficiency of star 
formation as a function of redshift. They will be used to find redshifts from 
the continuum-detected objects, when those are too obscured to be detected 
in the optical or NIR. In any case, they give complementary and unbiased 
information than in the optical, when the line width does not reflect the total 
mass, because of outflows and heavy extinction. 
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Fig. 54. {Top left) History of the star formation rate in the model (curve), com- 
pared with observations. The point at 2 = 0 is from Gallego et al. (1995), the 3 
following ones (full circles) from Flores et al. (1999), the empty circle from Yan et 
al. (1999), and the two high redshift points (filled squares), from Pettini et al. (1998). 
(Top right) Model cnrve for the far-infrared cosmic background. The filled triangles 
with a line joining them are derived by Puget et al. (1996) from GOBE, the filled 
circles are from Hauser et al. (1998), and the filled square from Dwek et al. (1998). 
(Middle left and right) Source counts predicted at 850 /im and 2 mm respectively: 
the solid line is the total. Dash is the lowest redshifts (2 < 2); Dot-dash, interme- 
diate (2 < 2 < 5); Dots are the highest redshifts (2 > 5). The emission from the 
dusty starbursts have been assumed a grey-body at a temperature of 35 K, and an 
opacity varying as The empty circles are data from Blain et al. (2000), and the 
straight line is a fit derived by Barger et al. (1999) at 850 pm. (Bottom left) Source 
counts for the GO lines at an observed frequency of 2 mm, assuming optically thick 
gas at Tex = 35 K. (Bottom right) Same for A = 5mm (from Gombes 2000). 

9 History of the Molecular Component 

9.1 Formation of H 2 

The formation of H2 molecules is fundamental in the early universe. H2 is 
the main coolant of the primordial gas, to allow the collapse of structures 
(taken over afterward by HD and LiH). The H2 molecule may be formed in 
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gas-phase reactions, involving H and/or H“'". It cannot be formed directly 
by radiative association 

H-bH^H2-b7, (56) 

since the molecule has no dipole (on the contrary HD can be formed through 
radiative association). At very high densities (larger than 10®cm“^), the 
three-body reaction can also take place. It is essential that a third body 
takes away the formation energy of the molecule. 

The formation mechanism is very different at 2 : = 0, the dust acts as a 
catalyst, and helps to form H 2 much quicker. As soon as dust is present, and 
ions have decreased, H 2 is formed preferentially on dust. 

The H 2 formation on grains may occur according to two processes (e.g. 
Herbst, 2000): 

• The Langmuir-Hinshelwood process, in which H atoms land on a grain 
and diffuse on the surface, by either tunneling, or hopping. 

• The Eley-Rideal process, where H atoms are fixed in position, chemisorbed, 
and a reaction occurs only when another H lands atop. 

The number of binding sites for adsorbers is of the order of A^ = 10® on 
a 0.1 fxm grain. The binding energy, or energy required for desorption (Eu) 
varies, according to the composition of the grain, it is for H on olivine of 
E]j = 372 K (Katz et al. 1999), and H on amorphous carbon Ed = 658 K; 
for H 2 respectively 314 and 542 K (Katz et al. 1999). 

Since the desorption energy is larger for heavier molecules, only H 2 can 
be formed like that, at 10 K. The grain, as a catalyst, absorbs the energy of 
formation, that can help to desorb another molecule. 

At low temperature, H-atoms don’t diffuse on the grain, and the sec- 
ond (Eley-Rideal) process is the only one. Recent experimental results have 
shown that the rate of diffusion of H-atoms on interstellar-like surfaces is 
significantly lower than estimated in the past. 

The formation of H 2 on grain surfaces is still poorly understood, because 
the details characteristics of the surfaces are poorly constrained and time- 
dependent (Herbst 2000). 

The interaction of H-atoms with the two kinds of interstellar grains, car- 
bonaceous and silicate, have been studied theoretically (although more for 
the H-graphite interaction). The first kind include the PAH- like grains, for 
instance coronene (C24H12). For the second kind, the formation of H“ on 
MgO or forsterite (Mg 2 Si 04 ) by charge transfer would enable the formation 
of H 2 via associative detachment (Sidis et al. 2000). Experimental formation 
of H 2 on amorphous carbon, or olivine, in the conditions of cold temperature 
5 — 20 K to simulate interstellar conditions, have also given some insight in 
the mobility of H on grains (Pirronello et al. 1999). 



Formation in Gas Phase Reactions: Lepp & Shull (1984) have computed 
the formation of the interesting molecules H 2 , HD, LiH and HeH+ during the 
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post-recombination epoch (z = 300 — 30). In standard nucleosynthesis, there 
exist only H, D, ^He, ^He, ^Li, and molecules have to be formed without 
dust. The molecules dominate the cooling and are responsible for the first 
cloud collapse and star formation. 

Cooling by Lyman-a is ineffective below T = 8000 K, and therefore after 
2: = 500. H2 is formed in small quantities from H“, and provides sufficient 
cooling to trigger the cloud collapse. Cooling is necessary since a Jeans mass 
will increase in adiabatic collapse, and there would be no fragmentation. 

The two well known processes that can form H2 in gas-phase reactions, 
at low densities, are the H“ and the HJ processes. The first process (cf rates 
by Bieniek & Dalgarno 1979) is: 

H-ke-^H--k 7, (57) 

H--kH^H 2 -ke" . (58) 

The second process (cf. Karpas et al. 1979) is: 

H -k H+ H+ -k 7 (59) 

H+ -k H ^ H2 -k H+ . (60) 

At high density, when the clouds are already collapsed, a third process inter- 
venes (e.g. Palla et al. 1983), the three-body process, for n > 10®cm“^: 

3H^H2-kH. (61) 



Results of Chemistry: Lepp and Shull (1984) made their computations 
in a standard Friedman model (i.e. A = 0), and Qq = = 0.1 (with only 

baryonic dark matter). The resulting evolution of fractional abundances are 
plotted in Fig. 55, with all channels for H2 formation indicated. Once Comp- 
ton heating becomes less than expansion cooling, radiation and matter cool 
adiabatically: 

Tr - (1 -k z) and T„, - (1 -k z)^ . (62) 

H2 reaches an abundance of 10“® in the diffuse medium. As for cooling rates 
per H2 molecule, the LiH dominates, for HD/H2 = lO”"* and LiH/H2 = 10“® 
(but since then, the abundance of the LiH molecule is thought to be much 
less, due to revised radiative association formation rate of LiH, Dalgarno et 
al. 1996). 

The cooling is enough for cloud to collapse, without being stopped by 
adiabatic regime. When a self-similar free-fall collapse is considered, at z = 
50, with the Jeans length Aj = 560 pc, and corresponding mass Mj = 10® Mq, 
much more molecules are formed. The most efficient cooling is by H2, then 
HD, then LiH. The cooling rotational lines become optically thick at: 

N = 10^^cm“^Z\t; for H2, 
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Fig. 55. Abundance fractions of atoms, ions and molecules versus redshift in a 
cosmological model where O = fib = 0.1, A = 0 and h = 0.5. Various H 2 formation 
channels are indicated by intermediate species in parentheses (from Lepp & Shull, 
1984). 



N = 10^° cm '^Av for HD, 

N = for LiH, 

where Z\ v in km/s is the velocity width of the line induced by cloud collapse, 
estimated at ~ 100 by Lepp & Shull (1984). H 2 becomes optically thick at 
the same point as 3-body reaction sets in, then the H 2 formation by 3-body 
reaction begins to yield a significant heating. LiH remains thin and effective 
coolant until higher densities of LiH/H= 10“^*^. 

9.2 Chemistry of the Early Universe 

A more detailed model, including 87 gas phase reactions with H, D, He, 
Li chemistry, has been devised by Galli & Palla (1998), who also compare 
their results with those of the various models in the literature (Lepp & Shull 
1984; Black 1991; Puy et al. 1993; Palla et al. 1995; Giroux & Shapiro 1996). 
They take into account the smaller radiative association for LiH, which was 
overestimated by a factor 100 (Stancil et al. 1996), and also non equilibrium 
chemistry in radiative shocks during collapse, better H recombination, 2-3 less 
electrons, etc. They pointed out the minimal network of chemical reactions, 
which can reproduce the more complex network, it includes 13 reactions 
involving H and He, 6 for deuterium, and 14 for Li (Galli & Palla 1998). 

At 2 ; = 1, they find e“/H = 3- 10“^, H 2 /H ~ 10“®, in the diffuse medium. 
They improved the cooling functions of H 2 for H— H 2 collisions, as a function 
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of density. The cooling function per molecule of H 2 , HD, LiH and Hj in 
the low density limit (n < 100 cm“^) is plotted in Fig. 56. They also show 
the influence of the cosmological parameters: H 2 is almost independent of 
cosmology, while Li is more variable, LiH and HD can vary by 2 — 3 orders 
of magnitude. 

9.3 First Structures in the Dark Age 

In the current best cosmological models, the first structures to form are the 
smallest one (bottom-up scenario); the structures form from small density 
fluctuations, that first expand with the Universe while growing linearly, then 
collapse non-linearly to bound systems. These fluctuations could be adiabatic, 
in which case they would be damped on scales below 3 • Mq 

before recombination (Silk 1968; where h is the Hubble constant in units 
of 100 kms“^ Mpc“^). Isothermal fluctuations can also exist (and, of course, 
also intermediate states), and in such cases, the first non-linear objects are 
much smaller. 

The growth of adiabatic fluctuations at a scale of lO^"* Mq (8 Mpc) is 
described schematically in Fig. 57. Radiation fluctuations grow until their 




Fig. 56. Cooling function per molecule of H 2 , HD, LiH and Hj in the low density 
limit n(H) < 100 cm“® (from Galli & Palla, 1998). 
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Fig. 57. The growth of adiabatic fluctuations at a scale of 10^^ M© (8Mpc). The 
fluctuations of radiation {solid line) initially grow up to the point when the mass 
contained within the horizon (a sphere of radius ct) is equal to their own mass. 
They remain constant thereafter and their amplitude can be fitted so as to be 
compatible with current observations (arrow). The fluctuations of matter in the 
standard model (dots) follow the radiation until recombination (R). After they 
can grow, but too slowly to reach the amplitude observed (hatched rectangle). The 
fluctuations of non-baryonic dark matter also follow the radiation, but until the 
end of radiative era (E). The amplitude of the dark-matter fluctuations is therefore 
larger, and the baryons which are in this model only 10% of the total mass, fall 
in the dark potential wells. In the massive-neutrino scenario the fluctuations are 
destroyed by dissipative effects (dot- dash) (from Turner 1984). 



mass equal that contained in the horizon ct] they remain constant after, 
and are calibrated with their present amplitude. The matter fluctuations 
follow radiation, if ionised. After recombination they grow too slowly in the 
standard model without non-baryonic matter, to reach the observed level 
today. Fluctuations of non-baryonic matter subject only to gravity, can begin 
to grow earlier, when the matter gravity dominates radiation. They trap 
baryons in their potential well, and the present amplitude of structures can 
be explained. 

The power spectrum of the fluctuations, as a function of wave number k 
is assumed self-similar, as a power law n: 



\5k\'^ = P(k) ~ A:” 



(63) 
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with n = 1 on large-scales (as confirmed by COBE), and becoming n = —3 
on small scales, after a spectrum tilt, when the matter density equals the 
radiation density pr ~ Pm at the horizon scale (Peebles 1982). The density 
fluctuations can then be deduced: 

^ (X M-("+3)/6 (64) 

P 

then the largest masses to become non-linear after recombination, i.e., at 
z ~ 1500, are of the order of M ~ 10®“® Mq, depending on Q, h and n, if 
the spectrum is calibrated from the presently observed correlation function 
(e.g., Peebles 1980). That means that all masses smaller than M ~ 10®“® Mq, 
and larger than the Jeans mass, which is M ~ 10® Mq(j^)“^/^(p^)“^ at 
z = 1000, will collapse, become bound, and decouple from expansion (at 
this epoch the Jeans length is of the order of 2 pc (6^)~^^^(6^)~^)- In fact, 
since in every hierarchical model with n > — 3 the smallest masses have more 
density contrast, structures at precisely the Jeans mass collapse first. The 
masses correspond to typical z = 0 giant molecular clouds. 



Nature of the Collapse: The nature of the collapse has been studied by 
many authors. If cooling is efficient enough (tcooI ~ Rj), the collapse is quasi- 
isothermal, and fragmentation occurs, since the Jeans mass becomes smaller 
and smaller as the density increases (e.g., Hoyle 1953). This means that 
pressure forces are at most equal to gravity forces, and cannot stop the col- 
lapse. This fragmentation process occurs currently in the Galactic interstel- 
lar medium in a self-similar hierarchical structure (Larson 1981; Scalo 1985), 
and might even be pursued down to very low masses in particularly cold and 
quiet regions, such as the outer parts of galaxies (Pfenniger & Combes 1994). 
Fragmentation is limited by opacity, and the smallest fragments (or clum- 
puscules) , which are at the transition of being pressure supported, are today 
of the order of 1O“®M0, and their mass grows slowly, as or (1 -|- z)^f^, 
with redshift. This assumes a quasi-isothermal regime for the bottom of the 
hierarchy (containing most of the mass); therefore it is important that gas 
cooling remains efficient. 

For gas just after recombination, above 10'^ K the main coolant is atomic 
hydrogen (by collisional excitation of Lyman-a) , and below this temperature 
the vibration-rotation lines of molecular hydrogen take over until T = 200 K, 
and we have seen in the previous section that significant quantity of H2 
molecules forms through H“ and H^. Below 200 K, HD is then more efficient 
(Palla & Zinnecker 1987; Puy & Signore 1996). Many groups have tackled 
the problem of computing the physico-chemistry of the primordial gas, to 
determine the size of the first forming bound structures (Yoneyama 1972; 
Hutchins 1976; Carlberg 1981; Palla et al. 1983; Lepp & Shull 1984). All of 
them have found that the cooling is indeed efficient, as soon as the redshift 
is below z ~ 200, and the calculated masses of fragments are in the wide 
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range 0.1 — 100 Mq. Even if clouds of just the Jeans mass see their collapse 
somewhat delayed because of pressure forces, the latter are negligible for 
clouds of M > 10® Mq (Lahav 1986). 

Most studies were interested in the first generation of stars. Already Pee- 
bles & Dicke (1968) concluded that the first objects to form were the globular 
clusters. Palla et al. (1983) in forming H 2 through the very efficient 3-body 
process, concluded that all the primordial gas is converted in H 2 at a density 
of 10^^cm“®, and that the Jeans mass eventually falls below 0.1 Mq, allow- 
ing the whole mass spectrum presently observed, for the first forming stars. 
Rotation could hinder the collapse (Kashlinsky & Rees 1983), and this could 
lead to Population III low-mass stars as well as super-massive objects. 

Observations of the present ISM show that the efficiency of star formation 
is very low, and it is very likely that the bulk of the gas in this first collapse of 
GMC-size primordial clouds after recombination and decoupling form clum- 
puscules (Pfenniger & Combes 1994). Most of the fragments remain pressure 
supported, as soon as optically thick, and form a fractal of molecular clouds. 
All the hierarchical structure remains at the background temperature. 



9.4 H 2 Formation and Cooling 



To test this possibility, the H 2 formation and gas cooling has been computed, 
in the hypothesis of fractal formation (Combes & Pfenniger 1998). The simple 
scheme of chemistry reactions summarised by Tegmark et al. (1997), including 
the H“, the and 3-H processes, was used. The cooling is essentially due 
to H 2 below lOOOK, then HD below lOOK. 

The density is computed from the simple top-hat model before viriali- 
sation. Then assuming efficient cooling, fragmentation occurs, as a fractal 
structure, of dimension D = 1.7. The number of fragments at each scale is 
N = 8, compatible with the fractal of the interstellar medium in the Milky 
Way today (Scalo, 1985). The scale ratio between two levels of the hierarchy 
is then = 3.4. The densities are in the ratio = 4.9. In 

the hypothesis of efficient cooling, the density as a function of time can be 
computed by steps corresponding to the free-fall time Tff,i of a given level i 
of the hierarchy: n{t) = Fragmentation stops when the clumps 

become opaque to the H 2 rotational IR lines, i.e., when n ~ 10^®cm“®. 

The cooling is efficient, if all the energy of the collapse is radiated away, 
and the temperature of the gas in the collapsing structures never becomes 
higher than the virial temperature: 



T 

J- V 



1420 K 



/ M 
Vio^Mq 



2/3 



(1 -I- Zvir) 
100 



(65) 



where Zvir is the redshift at which the giant cloud virialises. Some results of 
the computations are displayed in Fig. 58, for Zvir = 100. The horizontal full 
line indicates the virial temperature of the Jeans mass at recombination T„jr- 
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Fig. 58. Evolution of temperature (Te in 10® K), molecular fraction (/h 2 ), ioniza- 
tion fraction (xe) and density (ns in 10® cm“®) for a gas cloud after recombination, 
collapsing at Zvu = 100. The Hubble constant is h — 0.5, 17 = 1 and 17b = 0.06 
(from Combes & Pfenniger, 1998). 



The density increases by steps (corresponding to each scale of the recursive 
fragmentation), their temperature stays always below Tyir, and the clumps 
are completely molecular at the end. 

9.5 What Are the First Structures? 

This old question has received many different answers, according to assump- 
tions on the IMF and efficiency of star formation. The first objects could be 
gas clumps, brown dwarfs, 0.5 Mq MACHOs, standard IMF stellar systems, 
or massive black holes. Numerically, often symmetries are assumed (Boden- 
heimer 1986) with hydrodynamics, chemistry, but following the fragmenta- 
tion is difficult. There is the problem of the high dynamical range (8 — 15 
orders of magnitude), of non-equilibrium chemistry, non-linear dynamics, and 
in particular problems when the structures become optically thick. 

The highest resolution simulations have been carried out recently, with 
adaptive mesh refinement, to form the first star (see Fig. 59, Abel et al. 2000). 
While the virial radius of 5.6 • IO^Mq halo is 100 pc, the cell size is 0.024 pc 
and the simulation box 6.4 kpc. In a hierarchical scenario, dense gas blobs 
form at filaments crossing (Abel et al. 1997, 1998). 
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J=19.1 




Fig. 59. Density and temperature in the first cosmological objects expected to form 
in hierarchical structure formation scenarios. The upper panels show the log of the 
baryonic overdensity in a slice through the point of highest gas density at a scale of 
320 pc {left) and 32 pc {right). The lower panels give the corresponding plots of the 
log of the gas temperature. Additionally, the velocity field is also visualized in the 
upper left panel. Note that the computational volume simulated is 20 times larger 
than the left panels (from Abel et al. 2000). 



The simulation reproduces a low star-forming efficiency: less than 1% of 
the gas will form stars. The first stars or quasars are likely to appear in 
clumps of Tvir 10^ K at z ^ 20, and H 2 cooling plays a major role. 

The direct H 2 radiation will not be detectable, but as soon as there is star 
formation with larger than 1% efficiency, or quasar radiating at Eddington 




Molecules in Galaxies at All Redshifts 



201 



limit, it will be detectable with NGST at 2 ; = 15 (Haiman & Loeb 1998). In 
case of mini-quasars, there will be reionisation by z ^ 10. 

The average star formation efficiency can be probed through the enrich- 
ment of the Lyman-a forest. It is not larger than 2% in collapsed regions, 
not far from the 1% deduced from numerical simulations. But this deduction 
is based on the hypothesis of normal IMF, which is very uncertain. 



Efficiency of Star Formation: Observations show that the efficiency of 
star formation is highly variable, and can be very low in some cases (e.g., in 
low surface brightness galaxies). Since the physical conditions of the gas at 
z ~ 10 — 100 is similar to the present outer parts of galaxies, we expect that 
the conditions for triggering global star formation are generally not met. The 
reasons for low star-forming efficiency are then: 

• No deep potential wells, since galaxy-size structures have not yet virial- 
ized 

• No critical surface density for star formation 

• Collapsing mass always of the order of the Jeans mass, and not far from 
pressure support, since the Jeans mass decreases gradually as fragmen- 
tation proceeds. 

When objects of galaxy-size collapse, M ^ Mj, then violent instabilities, 
and starburst can occur, unless rotation stabilises the perturbations. 

Therefore, only sporadic star-formation (or even MACHOs formation) 
should occur at high redshift (z > 10 — 100), and that might be sufficient 
to initiate the reionization and the re-heating of the intergalactic medium 
(IGM), as is necessary to explain the high ionization fraction of the Lyman- 
a absorbers (Tegmark et al. 1984) and the omnipresent Hell gas at z ~ 2 
(Jacobsen et al. 1994; Davidsen et al. 1996; Reimers et al. 1997). The first 
generation of stars could form today a small percentage of the dark halos 
around galaxies under the form of white or brown dwarfs (Alcock et al. 1997). 

Once the bulk of the gas mass has condensed into molecular gas at z ^ 
150, the clumps survive the reionization because of their high column density 
(> 10^^cm“^), and their fractal structure (Combes & Pfenniger 1997). At 
the interface with the extra-galactic background radiation, they will have an 
ionised and an atomic envelope. If the background comes essentially from 
the quasar UV light, and provides an ionization rate of ^ ~ 2 • 10“^^ s“^, 
corresponding to the study of low-redshift Lyman-a absorption lines (Madau 
1992), then the column density at the interface is Nhi ~ 10^®cm“^, as is 
observed in the outskirts of HI disks (Corbelli & Salpeter 1993). 



Galaxy Formation: In this picture, a main peak of star formation will 
occur at galaxy formation, which could range between z = 50 and z = 0 (at 
z = 200, normal lOkpc galaxies would overlap). These galaxies would form 
through the merging of smaller entities. The protogalaxy collapse is violent 
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{M ^ Mj), and starburst may occur in their centers. In the outer parts, 
there can remain cold self-gravitating gas, as a reservoir for quiescent star 
formation, or new starburst in hierarchical merging. 



Cluster Formation: Another peak of star-formation is expected to occur 
when larger structures begin to collapse at z = 2, and galaxies merge in large 
quantities. Outer gaseous haloes are stripped, heated, and contribute to form 
the hot IGM, at T^ir of the galaxy cluster. A multiphase medium may survive 
(e.g. Ferland et al. 1994), in particular accounting for the cooling gas at the 
center (David et al. 1995). 



Evolution of Gas Content: The evolution of the gas in galaxies can be 
traced through the damped Lyman-a absorbing systems (DLA), that are 
thought to come from external parts of galaxies. 

The number of DLA traced by Mgll between z = 0 and 1.65, then by 
Lyman-a are plotted in Fig. 60. They follow the power-law distribution: 

n{DLA){z)o^{l + zf■‘^'^ (66) 

which is a strong evolution. However this result depends strongly on a few rare 
systems at the present time. Figure 60 shows that the gas density observed 
at z = 3 corresponds to the luminous density observed today (square in the 
figure) . 

9.6 Conclusion 

After recombination, GMCs of 10®“® Mq collapse and fragment down to 
1O“®M0, due to efficient H 2 cooling. The bulk of the gas might not form 
stars but a fractal structure, in statistical equilibrium with the cosmic back- 
ground temperature. Only a sporadic, low level, star formation is expected. 

After the first stars (and the first quasars?) have formed, reheating and 
reionization of the universe begins. The cold gas survives and will be assem- 
bled in larger scale structures to form galaxies. 

The fact that the bulk of the baryons condense without forming stars is a 
way to solve the cooling catastrophe (Blanchard et al. 1992). The H 2 fractal 
structure regulates the consumption of gas into stars, and play the role of a 
reservoir for later starbursts, and long-term quiescent star formation. 
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Dust plays an increasingly important role in astrophysics. Historically, 
dust was first recognized for its obscuring effects, and the need to correct 
observed intensities for attenuation by dust continues today. But with the 
increasing sensitivity of IR, FIR, and submm telescopes, dust is increasingly 
important as a diagnostic, with its emission spectrum providing an indica- 
tor of physical conditions, and its radiated power bearing witness to star 
populations of which we might otherwise be unaware. Finally, and most fun- 
damentally, dust is now understood to play many critical roles in galactic 
evolution. By sequestering selected elements in the solid grains, and by cat- 
alyzing formation of the H 2 molecule, dust grains are central to the chemistry 
of interstellar gas. Photoelectrons from dust grains can dominate the heating 
of gas in regions where ultraviolet starlight is present, and in dense regions the 
infrared emission from dust can be an important cooling mechanism. Finally, 
dust grains can be important in interstellar gas dynamics, communicating 
radiation pressure from starlight to the gas, and providing coupling of the 
magnetic field to the gas in regions of low fractional ionization. 

We would like to understand these effects of dust in the Milky Way, in 
other galaxies, and as a function of cosmic time. These lectures are organized 
around topics in the astrophysics of dust in the the Milky Way, as this is our 
best guide to understanding and modelling dust long ago and far away. 

1 Introduction to Interstellar Dust 

We begin with a brief review of some of the observational evidence which 
informs our study of interstellar dust. 

1.1 Interstellar Extinction 

Through study of open star clusters in the Galaxy, Trumpler (1930) found 
that distant stars were dimmed by something in addition to the inverse square 
law, and concluded that interstellar space in the galactic plane contained “fine 
cosmic dust particles of various sizes . . . producing the observed selective ab- 
sorption” . Over the past 7 decades we have built on Trumpler’s pioneering 
study, but many aspects of interstellar dust - including its chemical compo- 
sition! - remain uncertain. Let us therefore begin by reviewing the different 
ways in which nature permits us to study interstellar dust. 
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Trumpler analyzed the interaction of light with interstellar dust, and this 
remains our most direct way to study interstellar dust. We use stars as “stan- 
dard candles”, and study the “selective extinction” - or “reddening” - of 
starlight by the dust. With the assumption that the extinction (= absorption 
-I- scattering) goes to zero at wavelengths A — >■ oo, and observing the star at 
sufficiently long wavelength (to, in effect, determine its distance) one can de- 
termine the attenuation of the starlight by dust as a function of wavelength. 
Because atomic hydrogen absorbs strongly for hv > 13.6 eV, it is only pos- 
sible to measure the contribution of dust to the extinction at hv < 13.6 eV, 
or A > 912 A. A typical “extinction curve” - the extinction as a function of 
wavelength or frequency — is shown in Fig. 1, showing the rapid rise in ex- 
tinction in the vacuum ultraviolet. Observed extinction curves vary in shape 
from one line-of-sight to another, but appear to approximately form a one- 
parameter family (Cardelli et al. 1989); the parameter is often taken to be the 
ratio Rv = {Ab — Ay)! Ay, where Ab and Ay are the extinctions measured 
in the B (4400A) and V (5500A) spectral bands. A parametrization of the 
extinction curve was provided by Cardelli et ah; the curves in Fig. 1 were 




Fig. 1. Extinction at wavelength A, relative to the extinction at 7 = 900 nm, 
as a function of inverse wavelength A~^, for Milky Way regions characterized by 
different values of Ry = Ay j E{B — V), where Ab is the extinction at B — 4400 fim, 
Ay is the extinction at V — 5500 /rm, and the “reddening” E{B — V) = Ab — 
Ay. Note the rapid rise in extinction in the vacuum ultraviolet (A 0.2 pm) for 
regions with Ry ^ 4. The normalization per H nucleon is approximately Aj/Nn « 
2.6 X 10“^^ cm^/H. The silicate absorption feature (§1.5) at 9.7 pm and the diffuse 
interstellar bands (§1.6) are barely visible. 
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calculated using a more recent prescription by Fitzpatrick (1999), with the 
extinction in the infrared following Draine (1989b). 

We will discuss dust grain optics below, but it is clear that if the dust 
grains were large compared to the wavelength, we would be in the “geometric 
optics” limit and the extinction cross-section would be independent of wave- 
length. Therefore the tendency for the extinction to rise even at the shortest 
wavelengths where we can measure it tells us that grains smaller than the 
wavelength must be making an appreciable contribution to the extinction at 
all of observed wavelengths. As we will see below, “small” means (approxi- 
mately) that 27ra|m — 1|/A ^ 1, where m(A) is the complex refractive index. 
Thus if |to — 1| « 1 at A = 0.1 y^m, 

• we must have large numbers of grains with a ^ 0.015 /rm. 



1.2 Scattering of Starlight by Dust Grains 

When an interstellar cloud happens to be unusually near one or more bright 
stars, we have a “reflection nebula” , where we see the starlight photons which 
have been scattered by the dust in the cloud. The spectrum of the light 
coming from the cloud surface shows the stellar absorption lines, showing that 
scattering rather than some emission process is responsible. By comparing 
the observed scattered intensity with the estimated intensity of the starlight 
incident on the cloud, it is possible to infer the albedo of the dust - the ratio 
of scattering cross section to extinction cross section. The result is that in 
the optical the interstellar dust mixture has an albedo u> « 0.5 - scattering 
is about as important as absorption - and the grains are somewhat forward 
scattering, with (cos 9) « 0.5. Rayleigh scattering by particles small compared 
to the wavelength has (cos 6) « 0, so this tells us that 

• the particles dominating the scattering at A « 0.6 /rm 
have a ^ X/2 t: « 0.1 /xm. 



1.3 Polarization of Starlight 

The polarization of starlight was discovered in 1949 (Hall 1949; Hall & Mike- 
sell 1949; Hiltner 1949a,b). When it was realized that the degree of polariza- 
tion tended to be larger for stars with greater reddening, and that stars in a 
given region of the sky tended to have similar polarization directions, it was 
obvious that the polarization is produced by the interstellar medium: light 
propagating through the interstellar medium becomes linearly polarized as a 
result of preferential extinction of one linear polarization mode relative to the 
other. The polarization percentage typically peaks near the V band (5500A), 
and can be empirically described by the “Serkowski law” (Serkowski 1973): 

p(A) «p(A 

max ) exp[— ATln^(A/A 

max)] 1 



( 1 ) 
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with Amax ~ 5500 A and K « 1.15. This “linear dichroism” of the interstellar 
medium is due to dust grains which are partially aligned by the interstellar 
magnetic field. The peak polarization Pmax is found to fall within an envelope 
0 < Pmax < 0.09(if(i? — y)/mag), or 0 < py A 0.03 ry; the maximum values 
are presumed to arise on sightlines where the magnetic field is uniform and 
perpendicular to the line-of-sight. While the “Serkowski law” was put forward 
as a fit to the observed polarization at 0.3 pm A. A A 1 pm, it turns out to give 
a surprisingly good approximation to the measured linear polarization in the 
vacuum ultraviolet (Clayton et al 1992, Wolff et al 1997) as seen in Fig. 2. 

The mechanism responsible for the grain alignment remains a fascinat- 
ing puzzle, which we will discuss below. Independent of the grain alignment 
mechanism, however, we can infer the sizes of the interstellar grains respon- 
sible for this polarization by noting that the extinction rises rapidly into the 
UV whereas the polarization drops (Kim & Martin 1995). This can be un- 
derstood if the grains responsible for the polarization have sizes a such that 
a « (Amax/27r) « 0.1 pm: then as one proceeds into the UV one moves toward 




Fig. 2. Linear polarization in the nltraviolet measured by Clayton et al. (1992). 
The solid line is the “Serkowski law” fit to the data. Figure from Clayton et al. 
(1992). 
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the “geometric optics” limit where both polarization modes suffer the same 
extinction, so the polarization goes to zero: 

• The extinction at A « 0.55 /xm has an appreciable contribution from 
grains with sizes a « 0.1 /xm which are non-spherical and substantially 
aligned. 

• The grains with a ^ 0.05 /xm which dominate the extinction at A 0.3 /xm 
are either spherical (which seems unlikely) or minimally aligned. 



1.4 Spectroscopy of Dust: The 2175A Feature 

Of what is interstellar dust composed? One may look for spectroscopic clues in 
the extinction. The extinction curves in Fig. 1 show a conspicuous extinction 
feature at A“^ = 4.6 /xm“^, or A = 2175A. The feature is well-described by 
a Drude profile. The central wavelength is nearly identical on all sightlines, 
but the width varies significantly from one region to another (Fitzpatrick & 
Massa 1986). 

The strength of this feature implies that the responsible material must be 
abundant (Draine 1989a): it must be made from H, C, N, O, Mg, Si, S, or 
Fe. Small graphite grains would have a strong absorption peak at about this 
frequency (Stecher & Bonn 1965; Draine 1989a), due to tt — >■ tt* electronic 
excitations in the sp^-bonded carbon sheets. Since the carbon skeleton of 
polycyclic aromatic hydrocarbon (PAH) molecules resembles a portion of a 
graphite sheet, such molecules also tend to have strong electronic transitions 
at about this frequency. It therefore seems likely that the 2 175 A feature is 
due to some form of sp^-bonded carbon material. 

1.5 Spectroscopy of Dust: The Silicate Features 

There is a conspicuous infrared absorption feature at 9.7 pm, shown in Fig. 3. 
Silicate minerals generally have strong absorption resonances due to the Si-0 
stretching mode near 10 pm, and it seems virtually certain that the interstel- 
lar 9.7 pm feature is due to silicates. This conclusion is strengthened by the 
fact that the 10 pm emission feature is seen in the outflows from oxygen-rich 
stars (which would be expected to condense silicate dust) but not in the out- 
flows from carbon-rich stars. The interstellar 9.7 pm feature is seen both in 
emission (e.g., in the Trapezium region in Orion [Gillett, Forrest, et al. 1975]) 
or in extinction in the interstellar medium (Roche & Aitken 1984). 

Crystalline silicate minerals generally have sharp features in their 10 pm 
absorption which are not seen in the broad interstellar 10 pm feature, leading 
to the conclusion that interstellar silicates are probably amorphous. 

Near 18 pm warm interstellar dust shows another emission feature, which 
is attributable to the Si-O-Si bending mode in amorphous silicates. 
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Fig. 3. Infrared extinction, relative to extinction at 7 = 900 nm, showing the strong 
9.7 /rm and 18 /J.m silicate features, the 3.4 pm aliphatic C-H stretch, and a weak 
unidentified DIB at 1.32pm (Joblin et al. 1990). 




Fig. 4. Extinction at wavelength A, relative to the extinction at 7 = 900 nm, show- 
ing some of the diffuse interstellar bands. 



1.6 Spectroscopy of Dust: Diffuse Interstellar Bands 

The 3 features at 0.22 pm, 9.7 pm, and 18 pm are by far the strongest fea- 
tures seen in diffuse interstellar dust. There are, in addition, numerous weaker 
features in the optical known as the “diffuse interstellar bands”, or DIBs. 
Figure 4 shows the extinction for 1.5 pm“^ < < 1.75 pm“^, with sev- 

eral conspicuous DIBs present, most notably the DIB at 0.5780 pm. The 
strongest DIB falls at 443. Onm. Jenniskens & Desert (1994) report a total of 
154 “certain” DIBs in the interval 0.38 — 0.868 pm, plus another 52 “prob- 
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Wavelength (A) 

Fig. 5. Fine structure of the 5797A DIB toward ( Oph, ^ Per, and /x Sgr (from 
Kerr et al. 1998). Note in particular the very narrow feature labelled 0. 



able” detections. DIBs were discovered 80 years ago (Heger 1922) and their 
interstellar nature was established 68 years ago (Merrill 1934). 

It is embarrassing that Nature has provided astrophysicists with this 
wealth of spectroscopic clues, yet as of this writing not a single one of the 
DIBs has been convincingly identified! It seems likely that some of the DIBs 
may be due to free-flying large molecules; this hypothesis has received sup- 
port from high resolution spectra of the 5797A feature (see Fig. 5) showing 
intrinsic ultra-fine structure (Sarre et al. 1995; Kerr et al. 1998). 



1.7 Spectroscopy of Dust: The 3.4pm Feature 

There is a broad absorption feature at 3.4 pm which is almost certainly due 
to the C-H stretching mode in hydrocarbons. A recent study by Pendleton 
& Allamandola (2002) concluded that hydrocarbons with a mixed aromatic 
(ring) and aliphatic (chain) character provided a good fit to the observed 
interstellar absorption, including the 3.35 — 3.53 pm region. This included 
hydrocarbon Aims deposited following laser-ablation of amorphous carbon in 
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Ar, followed by exposure to atomic H (Mennella et al. 1999) or from a weakly- 
ionized plasma produced by laser-ablation of graphite in hydrogen (Scott & 
Duley 1996; Duley et al. 1998). 

1.8 Spectroscopy of Dust: Ice Features 

In dark molecular clouds a number of additional absorption features are seen, 
most notably a strong band at 3.1 /xm which is attributed to the 0-H stretch- 
ing mode in H2O ice. However, the 3.1 /xm feature is not seen on sightlines 
which are confined to diffuse interstellar clouds (Gillett, Jones et al. 1975). 

When a strong 3.1 /xm feature appears in absorption, a number of other 
absorption features are also seen, including features due to CO (4.67 /xm), 
CH3OH (3.53 /xm), and CO2 (15.2 ^m). The shape of the 3.1 /xm H2O feature 
is indicative of the type of ice and the impurities present in it. The relative 
strengths of the various features indicate that H2O is the dominant “ice” 
species, with NH3, CO, CH3OH, and CO2 as secondary constituents. 

1.9 Spectroscopy of Dust: PAH Emission Features 

A wide variety of galactic objects, including planetary nebulae, HII regions, 
photo-dissociation fronts, and reflection nebulae, have strong infrared emis- 
sion in the 3-13 /xm region. Most of the radiated power emerges in 5 broad 
infrared bands, at 3.3, 6.2, 7.7, 8.6, and 11.3/xm. In Fig. 6 we show the 4- 
15 /xm emission observed from the reflection nebula NCC 7023 (Cesarsky et 
al. 1996). These features are seen in many other objects as well (see, e.g., 
Boulanger et al. 1998). The emission is quite strong: a significant fraction of 
the starlight energy incident on the reflection nebula is re-radiated in these 




Fig. 6. Emission spectrum of the reflection nebula NGC 7023 measured by ISO- 
CAM (Cesarsky et al. 1996). 
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Fig. 7. Three examples of PAH molecules. 



infrared emission bands - the fraction can be as large as 10-20% depending 
on the spectral type of the star - so the particles responsible for this emission 
must be quite abundant, as we will discuss further below. 

The emission is thought to result from vibrational modes of polycyclic 
aromatic hydrocarbon (PAH) molecules. PAHs can be thought of as a section 
of a graphite “sheet” with the C atoms in a 2-dimensional hexagonal lattice, 
with the H atoms attached to C atoms at the edges of the lattice. Three 
examples of PAHs are shown in Fig. 7. When H atoms are attached to the 
edge of an aromatic ring, 

• the C-H stretching mode tends to be at 3.3 ^m, 

• the C-H in-plane bending mode is near 8.6 /xm 

• the C-H out-of-plane bending mode tends to fall at 11.3, 11.9, 12.7, or 
13.6 /im, depending on whether there are one, two, three, or four adjacent 
C-H bonds (“solo-”, “duo-”, “trio-”, or “quartet-” C-H). The observed 
spectra suggest approximately equal fractions of solo, duo, or trio C- 
H, and little quartet C-H, consistent with what one would expect for a 
mixture of large symmetric compact PAHs (Stein & Brown 1991). 

• The carbon skeleton has C-C-C bending modes near 6.2 and 7.7 /xm 

The precise position of the bands, and their relative strengths, vary from 
one individual PAH to another, and upon their state of ionization, but the 
observed spectra are in good agreement with what one might obtain from 
a mixture of neutral and ionized PAHs (Allamandola, Hudgins, & Sandford 
1999). 
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1.10 IR and FIR Emission 

The energy which grains absorb from starlight is re-radiated at longer wave- 
lengths, mainly into the far-infrared. We can estimate the emission spec- 
trum of the Galaxy using data from all-sky surveys by the IRAS and CORE 
satellites, and pointed observations by the Japanese IRTS satellite. At high 
galactic latitudes we know that most of the dust is illuminated by the local 
average starlight background. Theoretical studies of dust grain heating by 
starlight suggested that a ^ 0.01 /rm interstellar grains would be heated to 
temperatures 15K T ;$ 20K by the diffuse starlight (see, e.g., Draine & 
Lee 1984). 

Because there is little dust at high galactic latitudes, the infrared sur- 
face brightness is low, but from the correlation of infrared surface brightness 
with 21 cm emission one can extract the infrared emission per H atom. This 
has been done using photometry in the 4 IRAS photometric bands, the 9 
photometric bands of the DIRBE instrument on COBE, and using the spec- 
trophotometry by the FIRAS instrument on COBE. 

The Japanese IRTS satellite obtained the 3-13 /xm spectrum of the galactic 
plane (/ « 45°, b « 0). If we assume that the 3-13 /xm surface brightness is 
proportional to the 100 /xm surface brightness, we can estimate the 3-13 /xm 
emission per H atom. The results are shown in Fig. 8. The 3—13 /xm emission 
appears to be dominated by the same infrared emission features observed in 




Fig. 8. Observed infrared emission per H nucleon from dust heated by the aver- 
age starlight background (see text). Crosses: IRAS (Boulanger & Perault 1988); 
Squares: COBE-FIRAS (Wright et al. 1991); Diamonds: COBE-DIRBE (Arendt 
et al. 1998); Heavy Curve: IRTS (Onaka et al. 1996, Tanaka et al. 1996). The 
interpolated dotted line is used to estimate the total power. 
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reflection nebulae and planetary nebulae, and widely attributed to polycyclic 
aromatic hydrocarbons. 

If we interpolate to obtain the dotted line in Fig. 8, we can numerically 
integrate to obtain the total infrared emission power per H nucleon = 5.0 x 
10“^^ ergs“^H“^. About 2/3 of this power is radiated at wavelengths A > 
50 /im, which is where interstellar dust at T 20 K was expected to radiate. 
However, about 1/3 of the power is radiated at A < 50 ^m, where the emission 
from T < 20K dust should be negligible. Furthermore, about 1/6 of the 
total power is radiated at A < 10 /rm, primarily in 6.6, 7.7, and 8.6 /rm PAH 
features. 



1.11 Interstellar Depletions: Atoms Missing from the Gas 

Narrow optical and ultraviolet absorption lines seen in absorption in stellar 
spectra can be used to determine interstellar gas phase abundances of many 
elements. While some species, such as N and S, have gas phase abundances 
(relative to H) which are approximately solar, certain others, such as Mg, Al, 
Si, Ti, Ca, Fe, Ni, Cr show abundances which are far below solar. Since we 
presume that the interstellar abundances are approximately solar (Sofia & 
Meyer 2001), the atoms missing from the gas must be locked up in dust grains. 
In Fig. 9 the abundances have been plotted as a function of “condensation 
temperature” - the temperature at which cooling, solar abundance gas at 
LTE would begin to condense a particular element into a solid or liquid phase. 
Elements with Tcond ~ 1200 K are able to form “refractory” solids with large 
binding energies, and it is therefore not surprising that the strongly depleted 
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elements tend to be those with high values of Tcond- The observed elemental 
depletions provide a clue to the composition of interstellar dust. Restricting 
ourselves to abundant elements, we see that the “electron donor” species in 
interstellar dust material must be predominantly C, Mg, Si, and Fe. The 
grains could contain a significant amount of O, and perhaps H. 

1.12 A Provisional Grain Model 

Below we will be discussing the astrophysics of interstellar dust. It is helpful 
to have a specific grain model in mind so that one can carry out calculations. 
Accordingly, here we describe briefly a grain model which appears to be 
consistent with observations. The model consists of a mixture of carbonaceous 
particles and silicate particles. The carbonaceous particles have the physical 
and optical properties of polycyclic aromatic hydrocarbon molecules when 
they are small a < 50 A, or Nq A. 6 x 10^ carbon atoms. When they are 
larger, the carbonaceous grains are assumed to have the optical properties 
of randomly-oriented graphite spheres. The silicate grains are assumed to be 
amorphous silicates. 

The emission in the 3 — 12 /xm region from the diffuse ISM is attributed 
to dust grains small enough that a single starlight photon can raise the vi- 
brational temperature high enough for thermal emission in the observed vi- 
brational modes. In order for this emission to amount for ~ 21% of the total 
radiated power, very large numbers of very small dust grains are required. 
Li & Draine (2001) estimate that ~15% of the total carbon abundance must 
be in carbonaceous particle containing < 10^ C atoms. Since these particles 
must account for ~20% of the total absorption of starlight, it is obvious that 
they must make a significant contribution to the interstellar extinction curve. 

Weingartner & Draine (2001a, hereafter WDOl) have shown that there are 
dust size distributions which include the required large numbers of ultra-small 
carbonaceous grains and which are compatible with the observed extinction 
curves for various regions in the Milky Way, LMC, and SMC galaxies. The 
size distributions are by no means unique - the observed extinction curve 
cannot be “inverted” - but they are nevertheless strongly constrained, so the 
WDOl size distributions are probably not too far from the truth. Figure 10 
shows a grain size distribution appropriate for diffuse clouds in the local Milky 
Way. The total mass in grains with a A 100 A is constrained by the observed 
extinction in the ultraviolet, but since these grains are in the “Rayleigh limit” 
the extinction is virtually independent of the actual sizes and numbers of 
these particles, provided only that the constraint on the total volume of 
these grains is satisfied. 

The size distribution for a A 15A carbonaceous grains is adjusted to make 
the predicted infrared emission agree with observations. The bimodal nature 
of the carbonaceous size distribution for a ^ lOOA is probably an artifact of 
the fitting procedure. 
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Fig. 10. WDOl size distribution for dust consistent with an Rv = 3.1 extinction 
curve for the local Milky Way. The quantity plotted - a^dn/da - is proportional to 
the dust volume per logarithmic size interval. 



1.13 Far-Infrared and Submm Opacities 



If dust grains in a cloud have a temperature T^, the emergent intensity is 



L = B.(Td) [1 - , 


(2) 


Td(A) = k(A) J pds , 


(3) 


where the dust opacity 




1.4„„m„/* da • 


(4) 



p is the total mass density (gas + dust), and we have assumed riHe/ttn = 0.1. 
Dust opacities k are given in Fig. 12; these opacities are also available at 
http : //www . astro . princeton . edu/ ~draine/ dust /dust . html. 

At long wavelengths the optical depth Td <C 1. If we can estimate the 
dust temperature, the cloud mass per area can be obtained from the observed 
intensity Ii, by 

The derived mass density depends sensitively on the value of Td unless one 
is in the Rayleigh-Jeans limit, A ^ hc/kT^, in which case Jpds (x ■ 

2 Optics of Interstellar Dust Grains 

Our knowledge of interstellar dust is based in large part on the interaction 
of dust grains with electromagnetic radiation: absorption, scattering, and 
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Fig. 11. Dashed line: average diffuse cloud extinction curve. Solid lines: model ex- 
tinction curve for two extreme values of the ultra-small carbonaceous grain abun- 
dance: be = 0 (no PAHs) and be = 60 ppm C in PAHs. For each case the size 
distribution of carbonaceous and silicate grains was adjusted to obtain good overall 
agreement with the observed extinction. From Weingartner & Draine (2001a). 



emission. The interaction of electromagnetic radiation with a “target” will 
depend on the composition of the target (which determines the “response” of 
the target material to applied electric and magnetic fields) and the geometry 
of the target - both the shape and the size. Lacking definite knowledge of both 
the composition and the grain geometry, we must make some assumptions to 
proceed. 

2.1 Cross Sections, Scattering Matrices, and Efficiency Factors 

We are generally interested in calculating the following quantities for unpo- 
larized plane waves incident on the target: 

• C'abs = the total absorption cross section. 

• C'sea = the total scattering cross section. 

• dCaca/dfi = differential scattering cross section. This is related to the 
Muller matrix element S'!! by dCsca/df2 = Su/k'^, where k = 27t/A. 

• polarization P for the radiation scattered in a particular direction. 
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Fig. 12. Opacities k(A) = (dust absorption cross section) /(mass of gas + dust) for 
Milky Way dust with Rv = 3.1 and Rv = 5.5, LMC average dust, and dust in 
the SMC bar, computed for the carbonaceous/silicate dust model (Weingartner & 
Draine 2001a; Li & Draine 2001). 



In some cases we wish to explicitly consider polarized light. For this case 
the Stokes vector for the scattered radiation is obtained by multiplying the 
Stokes vector of the incident radiation by the 4x4 Mueller scattering ma- 
trix Sij. The 4x4 scattering matrix (which, for a fixed grain, is a function of 
direction of incidence and direction of scattering) fully characterizes the scat- 
tering properties of a grain. See Bohren & Huffman (1983) or Mishchencko, 
Hovenier, & Travis (2000) for discussions of scattering concepts. 

It is convenient to normalize scattering and absorption cross sections by 
dividing them by a geometric cross section. In the case of spherical target 
of radius a, the standard convention is to simply divide by the geometric 
cross section tto^. In the case of non-spherical targets, there is more than one 
convention in use. Some authors divide by the geometric cross section of the 
target as seen from the direction of the illumination. Other authors divide 
by the average geometric cross section for random orientations. 

My preference is to normalize by dividing by the geometric cross section 
of an “equal volume sphere”. Thus for a target of mass M, and made of a 
material with density p, the volume is H = M/p, and we define “efficiency 
factors t^scaj l^absj Qext = Qsca l^abs by 



Q 



c 



where 



^eff — 




(6) 
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2.2 Grain Geometry? 

Since interstellar dust is able to polarize starlight, we know that at least 
some interstellar dust grains must be appreciably non-spherical. Neverthe- 
less, for many purposes we will approximate dust grains as spheres for the 
simple reason that we can use “Mie theory” to calculate scattering and ab- 
sorption of light by spherically-symmetric targets with sizes comparable to 
the wavelength. Exact series solutions have been found for spheroids (Asano 
& Yamamoto 1975, Asano & Sato 1980; Voshchinnikov & Farafonov 1993) 
but numerical calculations are much more challenging than for spheres. So- 
lutions for other shapes (other than the unphysical case of infinite cylinders) 
are not available. 

2.3 Dielectric Functions 

First, a few words about conventions: We will follow c.g.s. electromagnetism, 
so that e = /X = 1 for the vacuum. We will represent plane waves as propor- 
tional to with this convention, absorption corresponds to Im(e) > 0. 

When an electric field is applied to a material, there are two distinct types 
of response possible: the response of the hound charge and the free charge. 
The bound charge (e.g., electrons and nucleus within an atom) undergoes 
a finite displacement when the electric field is applied, with the individual 
atoms or molecules acquiring an electric dipole moment in response to the 
applied electric field. The response of the bound charge is characterized by a 
complex dielectric function (w). 

The free charge responds in the form of an electric current density. 



J = aE 



( 7 ) 



where cr(w) is the electrical conductivity. 

When dealing with a monochromatic electric field it is convenient to define 
a dielectric function e describing the response of both bound and free charge: 

e = e(b°und)(^) + ^^^^(^) , (8) 

u 

With this definition, the free charge current J is now absorbed into the dis- 
placement current (l/47r)9Z?/9t, so that Maxwell’s equation becomes 



c ot 



(9) 



where D = eE. 

2.3.1 A Model Insulator 

It is instructive to consider a simple classical model for the dielectric re- 
sponse of an insulator. Suppose the insulator consists of “molecules”, with 
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no permanent dipole moment, but which can be polarized in response to 
the local applied electric field plus the electric field from all of the other 
polarized molecules. The dipole moment of a molecule is qx, where x is the 
displacement of the bound charge q. Suppose the displacement x behaves like 
a driven, damped harmonic oscillator. 



mx = qE — itiujqX — m — 
To 



(10) 



where ojq is the resonant frequency, and Tq is a damping time. For a periodic 
electric field E oc we can readily solve for the molecular polarizability 

a = qx/E\ 

. . ( 11 ) 



(w/wo)^ - *w/(wo'^o) 



where 



ao = 






mUJn 



(12) 



is the zero- frequency polarizability of the molecule. We now consider a medium 
consisting of a number density n of such molecules. Each now responds to 
the externally applied electric field plus the electric field due to all of the 
other polarized molecules. In the low frequency limit, the effective dielectric 
function e of the medium is related to the molecular polarizability through 
the famous Clausius-Mossotti relation (see, e.g., Jackson 1975) 



e = 1-k 



47ma 



1 — (47r/3)no; 



(13) 



For molecules on a cubic lattice, the Clausius-Mossotti relation (13) is exact 
in the limit \m\kd <C 1, where d = is the intermolecular spacing and 

k = Lojc. For finite fed, corrections to 0[{kd)^] have been derived by Draine 
& Goodman (1993). Substituting (11) into (13) we obtain 



e= 1-k 



Annao 



1 - (47r/3)nao - (<^/<^o)^ - fw/(wo'^o) 
In the low frequency limit w — >■ 0, we have 



Re(e(b°“d)) 



(bound) 

^0 



= i-k 



Airao 



1 — (47r/3)ao 



(14) 



(15) 



from which we note that 



47rofo = 3 



(bound) -I 

^0 ~ ^ 

(bound) 






-k2 



At low frequencies, the imaginary part of 
quency: 

Im(e(b°™d)^ « Aw 



(16) 

'd) varies linearly with fre- 
(17) 
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A = 



3 



(bound) 

^0 



^2 (bound) 

u eo 



+ 2 



(18) 



An insulator has no mobile charges (cr = 0), and for our model at low fre- 
quencies we have 



Re (e) — >■ = const , (19) 

Im (e) — >■ Aoj . (20) 



2.3.2 A Model Conductor 

For a material with a density Ug of free electrons, a simple classical model 
for the electron dynamics would be 



mgX = eE — rUe — 

Te 



(21) 



where Tg is the electron collision time. With this equation of motion, the 
conductivity cr(oj) = ngexjE is just 



cr(w) 



gp 

1 — i(jjTg 



(22) 



where 



UgC^Tg _ UJpTg 

nig 4 tt 



(23) 



where ctq is the d.c. conductivity, and tOp = (47rnee^/me)^/^ is the plasma 
frequency. For this case, the low-frequency behavior of e is 



Re (e) — >■ _ q-Tro-gTe = const , (24) 

Im (e) — >■ Auj H ^ . (25) 



Comparing (20) and (25), we see that the low-frequency behavior of Im(e) 
is qualitatively different for an insulator or for a conductor. Below we will see 
what this implies for the frequency-dependence of the absorption by small 
particles. 



2.4 Calculational Techniques 

In many astrophysical applications we are concerned with particles which are 
neither very small nor very large compared to the wavelength of the incident 
radiation. There are several different methods which can be used to calculate 
scattering and absorption cross-sections by targets which are neither very 
large nor very small compared to the wavelength: 
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• Mie theory solution for spherical targets (Bohren & Huffman 1983). Mie 
theory codes are highly-developed and readily available. They break down 
due to numerical roundoff error when the target becomes too large com- 
pared to the wavelength. A modified version of the Bohren & Huffman 
code is available at 

http : / /www . astro . princeton . edu/~draine/ scattering . html 

• Series solution for homogeneous spheroid or layered spheroid (boundaries 
must be confocal spheroids) (Asano & Yamamoto 1975; Asano & Sato 
1980; Voshchinnikov & Farafonov 1993). Codes for spheroids have been 
developed but have not been widely used. Codes based on the Asano 
& Yamamoto treatment had a reputation for being somewhat numeri- 
cally delicate (Rogers & Martin 1979). Voshchinnikov & Farafanov (1993) 
have developed a treatment based upon separation of variables which is 
reported to be more robust. They have recently generalized this to core- 
mantle spheroidal targets (Farafanov, Voshchinnikov, & Somsikov 1996). 

• Extended Boundary Condition Method (EBCM), often referred to as the 
“T-matrix method” (Mishchenko, Travis & Macke 2000). The EBCM 
method (Mischchenko, Travis & Macke 2000) is used to construct the “T 
matrix” which gives the coupling between vector spherical harmonic com- 
ponents of the incoming wave and vector spherical harmonic components 
of the scattered wave. Once the T-matrix has been obtained, orienta- 
tional averages can be efficiently calculated. The EBCM method appears 
to be well-suited to targets with rotational symmetry which are not ex- 
tremely elongated. EBCM T-matrix codes have been made available by 
M. Mishchenko at http : //www.giss .nasa.gov/^crmim/tjnatrix .html 

• Discrete Dipole Approximation (Draine & Flatau 1994; Draine 2000). The 
Discrete Dipole Approximation can be quite readily applied to complex 
geometries. It has been used to calculate the optical properties of graphite 
particles, with a highly anisotropic dielectric tensor (Draine 1988; Draine 
& Malhotra 1993). In recent years it has been accelerated by use of FFT 
techniques (Goodman, Draine, & Flatau 1991), and its accuracy has 
been improved by refinements in the assignment of dipole polarizabili- 
ties (Draine & Goodman 1993). The Fortran code DDSCAT is available 
from http://www.astro.princeton.edu/~draine/DDSCAT.html , and 
a detailed User Guide is available (Draine & Flatau 2000). 



2.5 Scattering by Homogeneous Isotropic Spheres 

The solution to Maxwell’s equations for a plane wave incident on a homo- 
geneous and isotropic sphere was independently obtained by Mie (1908) and 
Debye (1909). The solution - now generally referred to as “Mie theory” - 
is given in terms of a series expansion in powers of the “size parameter” 
X = 27ra/A. The expansion converges, but the number of terms which must 
be retained is of 0{x). The details of the Mie theory solution are nicely de- 
scribed in the excellent monograph by Bohren & Huffman (1983). There are 
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a number of computer codes to evaluate the absorption cross section and 
differential scattering cross sections given by Mie scattering theory. When x 
is large, care must be taken to avoid errors due to finite precision arithmetic 
in numerical evaluation of the series solution. 

For an incident monochromatic plane wave, the full Mie theory solution 
depends on the dielectric function e{uj) - describing the electric polarization 
of the material in response to an applied electric field oscillating at angular 
frequency oj - and the magnetic permeability function pl{ui) - characteriz- 
ing the magnetization of the material in response to an applied oscillating 
magnetic fields. Because the magnetic response is generally negligible (i.e., 
l/r — 1| <C |e — 1|) at frequencies above 100 GHz) it is customary to neglect 
the magnetization of the grain material except insofar as it is due to eddy 
currents. 



2.6 Discrete Dipole Approximation 

The discrete dipole approximation consists in replacing the (continuum) tar- 
get of specified dielectric function e with an array of polarizable points (re- 
ferred to as “dipoles”). The array geometry is chosen to mimic the target 
geometry. The polarizabilities a of the points are chosen so that an infinite 
lattice of such polarizable points would have the same dispersion relation as 
the material of dielectric function e (Draine & Goodman 1993). With FFT 
techniques employed to speed the calculation, it is now feasible to calculate 
scattering and absorption by targets represented by more than 100,000 po- 
larizable dipoles on a workstation with 256 MB of RAM. There are great 
efficiencies if the dipoles are situated on a cubic lattice, so DDSGAT requires 
that this be the case. 

The DDA can be applied to inhomogeneous targets and targets with com- 
plex geometries. As an example of the type of problem which can be solved, 
in Fig. 13 we show a discrete-dipole array of 61432 dipoles intended to ap- 
proximate a tetrahedral target. In Fig. 14 we show the calculated scattered 
intensity in the x-y plane as a function of scattering angle 0. Notice the pro- 
nounced scattering peak at 6> = 240° - this is the direction where one would 
have a specular reflection peak in the geometric optics limit. 

For a fixed target size D, complex refractive index m, and wavelength A, 
the DDA converges to the exact answer in the limit where the inter-dipole 
separation d — >• 0: the target structure is well-resolved and the dipole separa- 
tion is small compared to the wavelength in the target material. The criterion 
for separation small compared to A can be written \m\kd ^ 1. If |m| is not too 
large, we find that the overall scattering and absorption cross section is cal- 
culated fairly accurately provided \m\kd ^ 1; the detailed scattering pattern 
will be calculated accurately if \m\kd ^ 0.5. For example, the two calculations 
in Fig. 14, with \m\kd = 0.82 and 0.68, are already in fairly good agreement. 
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tetrahedron 

N=61432 

Fig. 13. Representation of a tetrahedron with a dipole array. The dipoles are ac- 
tually point-like - here they are represented by small spheres for purposes of vi- 
sualization. We consider incident radiation propagating along the direction of the 
vector kin illustrated here. Taken from Draine (2000). 

2.7 Infrared and Far-Infrared 

When the wavelength is long compared to the target size, we may use the 
“dipole approximation” (Draine & Lee 1984) and characterize the response 
of the target to the incident electromagnetic field purely in terms of the 
(oscillating) electric and magnetic dipole moments of the target. We may 
write 



C'abs = [Im(a|;) -b Im(a^)] , 


(26) 


Csca=y (^)"[|a|P+|aSn , 


(27) 



where a% is the (complex) electric polarizability along the direction of the 
incident electric field vector, and is the (complex) magnetic polarizability 
along the direction of the incident magnetic field vector. 

Consider a homogeneous ellipsoidal particle composed of an isotropic ma- 
terial, with semi-axes a, b, c. If the applied electric field is along one of its 
principal axes, the electric polarizability is 



47t (e — 1)Le + 1 



(28) 
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Fig. 14. S'!! = k^dCaca/dfl for scattering of unpolarized light by a tetrahedron, 
for scattering directions in the x-y plane (see Fig. 13), as a function of scattering 
angle 0. The tetrahedron has refractive index m — 1.33 + O.Oli, has edges of length 
s = 7.849A, where A is the wavelength of the incident light in vacuo, and is oriented 
with an angle 9 = 30° between kin and ai. The peak at 0 = 240° corresponds 
to the direction of specular reflection for geometric optics. Results are shown for 
a tetrahedron represented by = 61432 and N = 105832 dipoles. The scattered 
intensities are in excellent agreement in all directions where the scattering is at all 
strong. The dashed line shows k^{Zn) = Csca/dTr. Taken from Draine (2000). 



where Le is the “shape factor” along the direction of the applied E field, and 
V = Anabc/?) is the volume of the ellipsoid. The shape factors along the three 
principal axes satisfy the sum rule Li + L 2 + L 3 = 1 and may be obtained 
by numerical quadrature (Bohren & Huffman 1983). 

Consider now a spheroid with semi-axes a, b, b (prolate if a /6 > 1, oblate 
if a /6 < 1). For this case we have analytic expressions for the shape factors 
La and Lf, = (1 — La)j2. For a prolate spheroid (a < 6 ) we have (van de 



Hulst 1957) 





1 -b e 
1 — e 



1 



(29) 



and for an oblate spheroid (a < 6 ) we have 



L 



a 




- arctan(e) 



(30) 



where 



e^^ll 



(b/an 



(31) 
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Exact solutions for o;|; in the limit A 3> Oeff are also available for layered 
grains provided the interfaces are confocal spheroids (Gilra 1972; Draine & 
Lee 1984). 

For spheres we have La = Lb = 1/3, and 



^ _ 9loV £2 

^ (ei + 2)2 + £2 



(32) 



where £i = Re(£) and £2 = Im(£). 

Equation (32) contains an important result: the absorption cross section 
of a grain with size <C A is simply proportional to the grain volume! The far- 
infrared opacity therefore depends only on the total volume of grain material 
present, but not on the sizes of the particles, provided only that they are 
small compared to the wavelength A. 

How do we expect the absorption cross section to depend on frequency uj 
at low frequencies? For an insulator, from (20) we see that 



Gabs 



9V A 



c (cq -I- 2)^ 

while for a conductor, we see from (25) that 
Gabs ^ 

C \ dTTCTo 



OJ 



U! 



(33) 



(34) 



Thus we see that for both insulating grains and conducting grains, a simple 
physical model leads to Gabs oc at low frequencies! This is the basis for 
the expectation that dust opacities should vary as X~^ in the far-infrared, as 
in the dust models of Draine & Lee (1984). 

Now the above discussion has been based on simple classical models for 
the response of charge to applied electric fields. Real materials obey quantum 
mechanics, so results may differ. Furthermore, the above discussion assumed 
that w <C Wo in discussion of the response of the bound charge, but an amor- 
phous material could have very low frequency vibrational modes, so that the 
assumption that w <C wq might not be valid in the far-infrared. This might 
explain why some laboratory studies of amorphous materials find different be- 
havior: for example, Agladze et al. (1996) find that the absorption coefficient 
for amorphous Mg0-2Si02 varies as between 800 /rm and 4 mm. 

Nature must be our guide, but it is worth keeping in mind that opacities 
varying as emerge naturally from simple models. 



2.8 Kramers-Kronig Relations 

Purcell (1969) pointed out that the Kramers-Kronig relations can provide 
useful constraints in dust modelling. The Kramers-Kronig relations are gen- 
eral relations which apply to “linear response functions” , such as a dielectric 
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function, which specifies the response (e.g., the electric polarization) to an 
applied stress (e.g., an applied electric field). The only assumptions are that 
(1) the response is linear, and (2) the system is causal - the response can 
depend on the stress applied in the past, but cannot depend upon the future. 
With these very simple assumptions, it is possible to derive the Kramers- 
Kronig relations (see Landau & Lifshitz 1960 for a nice derivation). In the 
case of the dielectric function, the Kramers-Kronig relations are 



ei(wo) = 


1+ -P 




(35) 


7T 


Jo ^ 


£ 2 (^ 0 ) = 


2 / 

-( jJqP / 

■ 7 T /. 


-,.,2 ’ 


(36) 
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where P indicates that the principal value is to be taken. 

Thus the real and imaginary parts of e(w) are by no means independent 
- if either one is known at all frequencies, the other is fully determined. In 
order to have a physically acceptable dielectric function, one approach (see, 
e.g., Draine & Lee 1984) is to specify C 2 at all frequencies, and then construct 
ei(w) using (35). 



2.9 Kramers-Kronig Relations for the ISM 



Purcell showed that one could apply the Kramers-Kronig relations directly 
to the interstellar medium. Plane waves propagate through the interstellar 
medium but are attenuated by scattering and absorption. One can describe 
this attenuation by an imaginary component of the dielectric constant e of 
the interstellar medium, where |e— 1| <C 1. 

Recall that for an electromagnetic plane wave of frequency to, the wave 
vector is fc = m{uj)cijfc, where m(w) = ^/e is the complex refractive index. 
The electric field intensity decays as 



E oc exp [— Im(/c)a;] = exp 



1 

2 



Im(e')- 

c 



(37) 



The energy in the wave is proportional to \E\'^ and therefore decays twice as 
rapidly as E. Thus 

ngrOext(A) = ^ C2 . (38) 

We can now apply the Kramers-Kronig relation (35) to obtain the real 
part of the dielectric function e of the ISM at zero frequency: 

ei(0)-I=-/ — -ngrCext(w) . (39) 

The static dielectric function of the ISM is directly related to the electric 
polarizability Ogr of the grains in it: 



£l(0) — 1 = dTrUgr agr(O) 



(40) 
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Thus 
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(41) 


47rngrQ:gr(0) = — CUgr 
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/ dACext(A) . 


(42) 



0 



We have seen above how the static polarizability of a spheroidal grain de- 
pends on its volume, shape, and dielectric function. Averaged over random 
orientation, we have (for a dielectric grain) 



47ragr(0) = 3VF{a/b,€o) 



(43) 



F{a/b,€o) 



(eo - 1) 

3 



1 

(cq — l)3Ta -I- 3 
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(eq ~ 1)34^6 -l- 3 



(44) 



F is just the orientationally-averaged polarizability relative to the polariz- 
ability of an equal- volume conducting sphere. Using (43), we obtain the grain 
volume per H atom in terms of an integral over the extinction per H atom: 



rZgrU 

nn 
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"^Cext(A) 

nn 



(45) 



Now consider a conducting material. From (24-25) we see that e — >■ i x oo, 
so that 



1 2 



(46) 



so that a conducting sphere {La = Lh = 1/3) has F{a/b = l,eo = oo) = 1. 

We do not know the grain shape, and we do not know the static dielectric 
constant cq. However, from Fig. 15 we see that F < 1.5 unless the grain is 
extremely elongated (a/6 > 20 or 6/a > 20) and the static dielectric function 
is very large (eo > 10) - as for a metal. For a reasonable grain shape (e.g., 
0.5 < a/6 < 2) and dielectric function cq « 3 we have F « 0.4. 

Purcell’s analysis is a delight, and has two important consequences: 



2.9.1 Grain Volume per H 

It is of course not possible to measure the extinction per H atom at wave- 
lengths from 0 to oo. However, since Cext > 0, measurements over a finite 
wavelength range can be used to obtain a lower bound on FUngr/nu- The ex- 
tinction per H nucleon is fairly well-known from 0.1 /rm to 30 /im; a numerical 
evaluation gives 



^30 fim 



A^h 



dX 



1.1 X lO-^^cm^/H 



0.1 i-im 



(47) 
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10 



Fig. 15. Function F{a/b,e^) as a function of the static dielectric constant eo, for 
selected values of axial ratio a/b. After Purcell (1969). 

Approximately 50% of the integral is contributed by wavelengths 0.1 < A < 
1 fim. If we estimate F Ri 0.4 this gives 



or, for an assumed grain mass density p = 2.5gcm“^ (intermediate between 
graphite and silicate) we have a lower bound on the ratio of grain mass to H 
mass: 



2.9.2 Asymptotic Behavior at Long Wavelengths 

Equation (45) tells us that Jp°°Cext(A) d\ must be convergent, and therefore 
that Cext(A) must decline more rapidly than 1/A: grain models in which 
Cext oc 1/A for A — >■ 00 are unphysical. It is of course possible to have 
dlnCext/dlnA Ri — 1 over a limited range of wavelengths, but this cannot 
be the asymptotic behavior. 

2.10 Microwave 

Dust grains would be expected to radiate thermally at microwave frequencies, 
but this emission was expected to be quite weak, based on an extrapolation 
from the thermal emission peak at ~ 100 pm. It was therefore a surprise 
when sensitive maps of the microwave sky brightness (for the purpose of 
studying angular structure in the cosmic background radiation) showed rel- 
atively strong emission correlated with the Galactic 100 pm emission, and 




(48) 




(49) 
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Fig. 16. Microwave sky brightness correlated with 100 pm emission. Shown is the 
estimated microwave emissivity per unit H nucleon. 



therefore with interstellar dust (Kogut et al. 1996a, b). Figure 16 shows the 
inferred microwave emissivity of per H nucleon. The dashed line labelled 
“(3 = 1.7, Td = 19.5” is the contribution expected from thermal emission 
from dust grains if the dust opacity oc for A 100 pm. We see that at 
30 GHz (A = 1cm the observed microwave emission is two orders of magni- 
tude above the value expected from classical dust grains. 

The power radiated in the microwave is a tiny fraction of the total, so this 
microwave emission is not an important source of cooling. Nevertheless, we 
would like to know what process is responsible. Possible mechanisms include 
synchrotron emission from relativistic electrons, or free-free emission from 
a thermal plasma, but these seem unable to account for the observed 15 — 
50 GHz emission (see Draine & Lazarian 1999b). We are left with interstellar 
dust. 

The possibility of radio emission from rotating dust grains dates back 
to Erickson (1957). As discussed above, the strong infrared emission in the 
3—12 pm range appears to require a very large population of very small dust 
grains. Ferrara & Dettmar (1994) pointed out that these dust grains would 
produce detectable radio emission if they were undergoing Brownian rotation 
at the gas temperature. 

Draine & Lazarian ( 1998a,b) analyzed the grain rotational dynamics, and 
showed that the population of ultra-small grains required to understand the 
infrared emission would be expected to produce microwave emission, with 
a predicted spectrum (and intensity) as shown in Fig. 16. We will discuss 
the grain dynamics in §6, but this rotational emission still seems likely to 
contribute a substantial fraction of the observed dust-correlated microwave 
emission. In principle, these rapidly spinning dust grains could be aligned. 
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leading to polarized emission, but the polarization fraction has been esti- 
mated to be small (Lazarian & Draine 2000). 

However, if dust grains contain magnetic materials - which does not seem 
implausible, considering the large amount of Fe in interstellar grains - there 
could also be thermal magnetic dipole emission from dust. Consider a ferro- 
magnetic domain in a dust grain. This domain is spontaneously-magnetized: 
it is energetically favorable to have alignment of electron orbital angular mo- 
mentum and electron spins. The lowest energy state has some specific mag- 
netization, but there are nearby energy states where the domain has uniform 
magnetization with the same net magnetic moment but in a slightly differ- 
ent direction. In thermal equilibrium, it is possible for these higher energy 
states to be excited, so that the magnetization direction will fluctuate. This 
results in a time- varying magnetic dipole moment, and hence magnetic dipole 
radiation. 

The optics of magnetic dust grains with size small compared to the wave- 
length has been developed by Draine & Lazarian (1999a; hereafter DL99a), 
and the magnetic susceptibility was estimated for several magnetic ma- 
terials, including metallic Fe and magnetite Fe 304 . The spectrum and in- 
tensity of this radiation was estimated by DL99a, and it was found that 
if a substantial fraction of the Fe in interstellar grains is incorporated into 
magnetic materials (e.g., magnetite), the resulting thermal emission could 
account for an appreciable fraction of the observed microwave emission! 

How will we be able to distinguish between spinning dust grains and 
magnetic dust grains as sources of microwave emission? DL99a point out 
that small grains are believed to be under-abundant in dense clouds (they 
presumably coagulate to form bigger grains), so that the rotational emissivity 
per H nucleon would be reduced. The magnetic dipole emission from magnetic 
grain materials does not, however, depend on the grain size. Hence if we see 
strong microwave emission from dust in dense clouds, this would suggest 
that magnetic dust grains may be responsible. Future pointed observations 
of dense clouds may answer this question. 

2.11 X-rays 

The scattering of X-rays by dust grains was first discussed by Overbeck 
(1965), Slysh (1969), and Hayakawa (1970). Because the refractive index of 
grain materials is close to unity at X-ray energies, the scattering is through 
small angles. As a result, images of X-ray point sources show a “halo” of 
dust-scattered X-rays. The observable halo can extend for tens of arcminutes 
from the source. 

For sufficiently small grains, this scattering can be calculated using the 
Rayleigh-Gans approximation (see Bohren & Huffman 1983), which is valid 
if |m — \\ka <C 1, where a is the grain radius. Because X-rays have large 
k, this condition may not be satisfied for the larger interstellar grains, in 
which case one should resort to full Mie theory. This can be numerically 
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challenging because the number of terms which must be retained is of or- 
der ka « 507(a/0.1 /im)(i?/keV), so high numerical accuracy is required. 
Smith & Dwek (1998) have compared scattering halos estimated using the 
Rayleigh-Gans approximation with Mie theory, and show that the Rayleigh- 
Gans approximation fails for energies below 1 keV. Scattering by grains with 
|to — l|fca <C 1 and ka ^ 1 can be treated using “anomalous diffraction 
theory” (van de Hulst 1957). 

Nova Gygni 1992, a bright X-ray nova, was observed by ROSAT. The 
observed X-ray halo has been compared to predictions for different grain 
models by Mathis et al. (1995), Smith & Dwek (1998), and Witt, Smith & 
Dwek (2001), with differing conclusions. The most recent study (Draine & 
Tan 2002) concludes that our standard dust model is in reasonable agreement 
with the observed X-ray halo. Future observations of X-ray scattering halos 
will be valuable to test and constrain grain models. 



3 IR and Far-IR Emission from Interstellar Dust 



3.1 Heating of Interstellar Dust 



While dust grains can be heated by collisions with gas atoms and molecules, 
starlight usually is the dominant heating process. The rate of energy depo- 
sition, the rate of photon absorptions, and the mean energy per absorbed 
photon are 




dv U^cCe,hs{v) 



(50) 



f^abs 




e Gabs (b') 



(51) 
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Nabs 
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(52) 



where dv is the photon energy density in {v, v + dv), and Gabs(^^) is the 
photo-absorption cross section. We assume that all of the energy of an ab- 
sorbed photon is converted to heat; this is not exactly correct, since energetic 
photons may eject a photo-electron, or excite fluorescence, but these processes 
take away only a small fraction of the total absorbed power. 

Now suppose that the dust grain has energy E distributed among its 
vibrational degrees of freedom. Except when the energy E is very small, the 
number of vibrational modes which can be excited is very large. The density 
of states has been discussed by Draine & Li (2001): even for a small molecule 
like coronene G 24 H 12 , there are ^ 10 ^° different states with total energy 
E < leV; for G 4000 H 1000 this number becomes ~ 10^^®. As a result, the 
statistical notion of “temperature” can be used, even for quite small grains: 
a grain with vibrational energy content E is assumed to be characterized by 
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a temperature T{E) such that if the grain were in contact with a heat bath 
at temperature T, the expectation value for its energy would be E. 

At temperature T, the average power radiated by a grain is 




dvCa^hsiv) ATTBy{T) 



(53) 



where Cahs{v) is the angle-averaged absorption cross section for photons of 



frequency v, and 



B.{T) 
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(54) 



is the Planck function. 

It is now natural to determine the “steady-state” temperature Tgs at which 
the power radiated equals the power absorbed: 



dvC^hs{v) 47tB^(Tss) 




(55) 



Once we solve for Tgg, we calculate the thermal energy content of the grain 
at this temperature, E(Tss)- There are two regimes: 



• If E(Tss) ^ {hv)ahs, then individual photon absorption events do not 
substantially change the energy content of the grain, and we may assume 
that the grain temperature T = Tss- This is the “steady heating” regime, 
where we can approximate the discrete heating events as a continuous 
process. 

• If E{Tss) <C (/iJ/)abs, then individual photon absorption events heat the 
grain up to peak temperatures T^ax ^ Tss, and the grain will usually cool 
to a temperature Tmin < Tss before the next photon absorption event. 
This is the “stochastic heating” regime, where the discrete nature of the 
heating has important consequences. As we will see below, stochastic 
heating is often important. 



Figure 17 shows the temperature history of 4 grains of different sizes in the 
diffuse interstellar medium over a ~ 10® s interval - about one day. The 
temperature of the a = 200A grain fluctuates in a small range around ~ 20 K. 
However, as the grain size is reduced, the fluctuations become increasingly 
extreme, with the peaks being higher and the low points being lower. The 
smallest grain shown (a = 25A) absorbs about 1 starlight photon per day; this 
grain spends most of its time quite cool, but immediately following absorption 
of a photon the grain temperature reaches ~ 50K. The grain cools by infrared 
emission; it is obvious that most of the infrared emission must take place 
during the brief interval while the grain is “hot”. If we wish to calculate 
the time-averaged emission spectrum, we cannot use the “average” grain 
temperature - we need to integrate over a distribution of grain temperatures. 
This phenomenon is even more pronounced for smaller grains. A grain with 
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Fig. 17. A day in the life of an interstellar grain: grain temperature vs. time for 
4 grain sizes, for grains heated by the average interstellar radiation held. Grains 
with a ^ 200A have a nearly constant temperature, but a A. lOOA grains show con- 
spicuous increases in temperature following each photon absorption, with gradual 
cooling between photon absorption events. 



a radius of SA has a volume only 1/125 that of the 25A shown in Fig. 17; 
the mean time between photon absorptions is of order ~ 10^ s, but a 10 eV 
starlight photon can raise the grain temperature to ~ 10^ K. 

Let P{E) be the probability of the grain having vibrational energy E' > 
E. In the steady heating regime, we approximate P{E) as a step function, 
and \dP/dE\ as a delta function 6{E — Ess)- 

In the stochastic heating regime, we must solve for P{E). Draine & Li 
(2001a) show examples of energy distribution functions. If the energy states 
are grouped into bins j = 0, 1, . . . (where j = 0 is the ground state), then 
we can calculate the state-to-state transition rates Tji for transitions i ^ j 
due to both photon absorptions and photon emissions. Once Ti^j is known, 
we define the diagonal elements Tu = — The steady state solution 

Pj for the probability of being in state j then satisfies the N coupled linear 
equations 

M 

Q = ^TijPj forf = 0,...,M 
1=0 



(56) 
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Using the normalization condition ~ obtain a set of M linear 

equation for the first M elements of Pj: 



M-l 

{Tij - PM)Pj = -TiM for i = 0, . . . , M - 1 , (57) 

j=o 

which we solve using standard techniques. In practice, we take M « 500, in 
which case iterative methods are required to efficiently solve for Pj . Once the 
Pj are determined for j = 0, . . . , M — 1 we could obtain Pm by subtraction 
Pm = 1 — but this is inaccurate; it is better to instead use 



Pm = 



-1 

Tmm 



M-l 



3=0 



(58) 



Figure 18 shows dP/dlnE for carbonaceous grains heated by the average 
interstellar radiation field (ISRF) due to starlight in an HI region, as esti- 
mated by Mathis, Mezger, & Panagia (1983, hereafter MMP). For a ^ 25A 
grains dP/dlnE becomes very small for E/hc > (l/91lA) = 1.1 x 10® cm“^: 
this reflects the fact that grains cool essentially completely between photon 
absorption events, so that the energy content virtually never rises above the 
maximum energy (13.6 eV) of the illuminating photons. As the grain size 
increases, the time between photo-absorptions goes down, and the cooling 
time at fixed energy goes up; when the grain size exceeds ~ 30A there is a 
significant probability of a photo-absorption taking place before the energy of 
the previous photo-absorption has been radiated away. However, stochastic 
heating effects are noticeable even for grains as large as 300A - the energy dis- 
tribution function has narrowed considerably, but is still appreciably broad. 



3.2 IR and Far-IR Emission Spectrum 

With the energy distribution function calculated as discussed above, we can 
now calculate the time-averaged emission spectrum for a carbonaceous grain: 

Px = ATTjdE — Cabs(A) Bx{T{E)) . (59) 

In Fig. 19 we show the emission spectrum of PAH"*' molecules of various sizes 
heated by the ISRF, and in Fig. 20 we compare the emission spectrum calcu- 
lated using the energy distribution functions dP/dlnE with emission spectra 
calculated for dust with steady temperature Tg^. We see that stochastic heat- 
ing is important even for grains as large as ~ lOOA in the ISRF. 

In Fig. 21 we show the emission spectrum calculated for a mixture of 
carbonaceous and silicate grains with size distributions consistent with the 
average (Ry = 3.1) interstellar extinction curve, illuminated by the local 
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Fig. 18. Energy distribution functions for charged carbonaceous grains with radii 
a = 5, 10, 25, 50, 75, 100, 150, 200, 250, 300A in the interstellar radiation field. The 
discontinuity in the 5, 10, and 25A curves is an artifact due to a change in the 
method of estimating the cooling when the energy is equal to the 20th vibrational 
mode. For 5, 10, and 25A, a dot indicates the first excited state, and the probability 
Po of being in the vibrational ground state is given. Taken from Li & Draine (2001). 



ISRF. This should be the emission spectrum of the “cirrus” clouds. Also 
shown in Fig. 21 are observations of the emissivity per H atom of the dust 
at high galactic latitudes. 

Diamonds show DIRBE photometry; the small triangles show our calcu- 
lated spectrum convolved with the DIRBE filters, for comparison with the 
diamonds. The squares show the FIRAS determination of the emissivity per 
H nucleon. The far-infrared emission is in excellent agreement with the pre- 
dictions of our dust model. The model is in very good agreement with DIRBE 
photometry at 3.5 /im and 25 /xm; the model appears too low by about a fac- 
tor 2 at 5 and 12 /xm. The observations are difficult; it is hoped that SIRTF 
will be able to measure the spectrum of selected cirrus clouds for comparison 
with our model. 

In Fig. 22 we compare our model with observations taken on the galac- 
tic plane, where the FIR surface brightness is much higher. In addition to 
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Fig. 19. IR emissivity per C atom for PAH+ molecules of various sizes illuminated 
by the average ISRF. Solid line: “exact-statistical” calculation, which uses tran- 
sition probabilities which do not involve the thermal approximation. Dotted line: 
“thermal-discrete” calculation where the spontaneous emission rates are calculated 
using a thermal approximation, as discussed in the text. The two methods are 
in excellent agreement, indicating that the thermal approximation can be used to 
calculate the transition rates. Taken from Draine & Li (2001). 



DIRBE photometry, spectroscopic observations made with the IRTS satellite 
are available. 

These observations are looking in the galactic plane at the “tangent” point 
of gas in the 6kpc ring, at a distance of 6kpc from us. At this distance, the 
40' DIRBE beam subtends 70 pc, so the observations sum the emission from 
dust in a region ~ 70 x 70 pc on a side, and many kpc long. We compare 
the observations to a model which is obviously oversimplified: a uniform slab 
of dust heated by a uniform radiation field with the spectrum of the local 
ISRF but whose intensity we scale so that the dust emission best reproduces 
the observed spectrum: the best fit is obtained with the starlight intensity 
equal to twice the ISRF. The DIRBE beam obviously includes emission from 
stars, which we believe dominate at A < 4 /xm, so our model includes light 
from stars assumed to be mixed uniformly with the dust. This simple model 
is in good agreement with the observations. In particular, we successfully 
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Fig. 20. IR emission per grain for carbonaceous grains of various sizes illuminated 
by the average ISRF. Solid line: full stochastic heating calculation. Broken line: 
assuming steady-state temperature Tag. For a ^ 200A in the ISRF we see that 
stochastic heating has little effect on the emission spectrum, but for a A 150A it 
signihcantly modifies the emission spectrum. Taken from Li & Draine (2001). 



reproduce the very strong emission in the 5—12 /xm region. The discrepancy at 
100 /im is somewhat surprising, but may in part be due to oversimplification 
of our model, and perhaps in part to uncertainties in the DIRBE absolute 
calibration. 



3.3 The Small Magellanic Cloud (SMC) 

How successful is this dust model in reproducing the emission observed from 
other galaxies? The SMC, with metallicity ^ 10% of solar (Dufour 1984), 
and dust-to-gas ratio ~ 10% of the Milky Way (Bouchet et al. 1985) is a 
good test case. Li & Draine (2002) sum over both quiescent medium and 
regions of active star formation in the SMC, find that to match the observed 
spectrum they require a distribution of starlight intensities. Following Dale 
et al. (2001) they adopt a power-law distribution of starlight intensities. 

The dust extinction in the SMC is known to differ from Milky Way dust, 
primarily in the absence of a 2175A extinction bump on most (but not all) 
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Fig. 21. IR emission per H for dust mixture illuminated by the ISRF. Diamonds: 
DIRBE (Arendt et al 1998). Squares: FIRAS (Finkbeiner et al. 1999). From Li & 
Draine (2001). 



SMC sightlines (Gordon & Clayton 1998). Since we take the view that the 
2175 A feature is due to PAHs, the absence of this feature implies a low PAH 
abundance. We adopt the dust mixture of Weingartner & Draine (2001a), 
which reproduces the measured SMC bar extinction curve. 

With suitable choice of illuminating radiation field, the dust model is able 
to reproduce the observed IR emission from the SMC. It appears that the 
carbonaceous/silicate grain model (Weingartner & Draine 2001a; Li & Draine 
2001) can reproduce both the observed interstellar extinction by dust and the 
observed IR/FIR emission from dust. 

4 Charging of Interstellar Dust 

Why do we care about the charging of dust grains? There are a number of 
reasons: 

• Charged dust grains are coupled to the magnetic field, whereas neutral 
grains are not. This is important not only for the motions of the dust 
grains, but also as a mechanism for coupling magnetic fields to neutral 
gas. 
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Fig. 22. IR surface brightness toward (a) I = 44°20', b = — 0°20'; (b) I = 47°45'. Di- 
amonds: DIRBE photometry. Thin line, upper panel: 5—12 /rm spectrum measured 
by MIRS on IRTS (Onaka et al. 1996). Thin line, lower panel: 2.8 — 3.9 /rm spectrum 
measured by NIRS on IRTS (Tanaka et al. 1996). Solid line: carbonaceous-silicate 
dust model illuminated by 2xISRF, plus reddened starlight. From Li & Draine 
( 2001 ). 



• Charged dust grains undergo stronger gas drag due to Coulomb interac- 
tion with ions in the gas. 

• The photoelectric charging process injects energetic photoelectrons into 
the gas, which is a major mechanism for heating interstellar gas. 

• Neutral and negatively-charged dust grains can play an important role 
in neutralization of ions. 

• In cold, dense regions, the dust grains may “lock up” a substantial frac- 
tion of the electrons. 



4.1 Collisions with Electrons and Ions 

Consider a spherical grain of radius a and charge Zge, and a charged particle 
of charge ze, kinetic energy E, on a trajectory with impact parameter b. If the 
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Fig. 23. Comparison between dust model {solid line) and the observed spectrum of 
the SMC obtained by COBE/DIRBE (diamonds) and IRAS (squares) averaged over 
a 6.25 deg^ region including the optical bar and the Eastern Wing (Stanimirovic et 
al 2000). Model spectrum includes infrared emission from dust heated by a range 
of starlight intensities, plus starlight, which dominates at A 7 fim. Triangles show 
the model spectrum convolved with the DIRBE filters. Taken from Li & Draine 
( 2002 ). 



interaction potential is taken to be a Coulomb potential, then conservation of 
energy and angular momentum allow one to find the critical impact parameter 
6crit for the particle to just graze the grain surface. Trajectories with b < 6crit 
will collide with the grain, and trajectories with b > bent will miss. The 
collision cross section is simply 






= TTO 
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zeU 



for E > zeU 



= 0 



for E < zeU 
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where U = Z^eja is the electrostatic potential at the grain surface. If the 
velocity distribution is thermal, then the rate at which the charged particles 
reach the grain surface is just 



dN 2 
— — = mra 
dt 




exp(-f|f) 



for zeU > 0 
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for zeU < 0 
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This is a wonderfully simple result. If the sphere is located in a plasma of 
electrons and ions, and the only charging process is collisional, then the sphere 
will become negatively charged since the electrons move more rapidly than 
the ions. The steady-state charge is such that suppression of the electron 
arrival rate, and enhancement of the ion arrival rate, make them equal: 
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where we have assumed Zj = -|-1. Taking logarithms of both sides we obtain 
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(64) 



with the solution ell/kT = —2.51 for a hydrogen plasma {mi /me = 1836) 
with Si = Se- 

However, the Coulomb potential does not fully describe the interaction be- 
tween a charged particle and a grain. In the case of a neutral grain, the electric 
field of the approaching charged particle will polarize the grain, resulting in an 
attractive potential - the same effect that causes ion-neutral scattering rates 
to be large, even at low temperatures.^ Draine & Sutin (1987) have calculated 
collision rates including the image potential. Image potential effects are im- 
portant for neutral grains when e^/a kT, or aT ^ 1.67 x 10“^ cmK. Thus it 
is an important correction for small grains in cold gas (e.g., a ^ 1.67 x 10“^ cm 
for T= 100 K). 

The “sticking coefficients” for electrons and small ions are not well known. 
In the case of an ion with an ionization potential larger than the work function 
for the grain material, it seems likely that the ion, upon arrival at the grain 
surface, will either remain stuck to the grain or will strip an electron from the 
grain and depart; in either case the ion “sticking coefficient” s/ = 1. The case 
of impinging electrons is less clear. The electron might be elastically reflected 
from the grain surface, or the electron might pass through a small grain and 
out the other side without energy loss, in which case the electron would have 
enough energy to return to infinity. 

Experimental sticking coefficients for small neutral molecules or molec- 
ular ions can be estimated by dividing the measured electron capture rate 
coefficient by the estimated rate at which the electrons would reach the sur- 
face of the ion. Figures 24 and 25 show the electron sticking coefficients for 
small neutral and charged carbonaceous clusters containing 10 < Nq < 100 
carbon atoms. 

In hot gas, it is possible for impinging electrons to eject an electron from 
the grain. For molecules, this is called “ionization”; for larger objects, it is 
called “secondary electron emission”. It is therefore possible for grains to 

^ This is just the familiar “image potential” from electrostatics for a charge near 
a conductor. 
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Fig. 24. Electron sticking coefficient for small neutral grains. From Weingartner & 
Draine (2001c). 



become positively charged by collisions in a hot (T 10® K) plasma (Draine 
& Salpeter 1979a). 

4.2 Photoelectric Emission 

The electrons in a neutral grain can be thought of a being confined within 
a potential well, as shown in Fig. 26. The potential well is produced by a 
“double layer” of charge at the boundary of the grain (the electron charge 
density extends beyond the ion charge density, resulting in a region near the 
surface with a radial electric field - see standard texts on solid state physics, 
e.g., Ashcroft & Mermin 1976). 

Three cases are shown: 

• For a grain with Z > 0, the excited electron will be subject to a long-range 
Coulomb attraction by the rest of the charge. In this case the electron 
which physically gets outside of the grain must still have additional energy 
in order to reach oo. 

• For a grain with charge Z = —1, the excited electron sees a system with 
zero net charge, so once the electron is physically outside the double layer, 
it will continue to infinity. 
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Fig. 25. Electron sticking coefficient for small grains with charge Z = +1. From 
Weingartner & Draine (2001c). 



• For a grain with charge Z < —1, an electron which gets outside the 
double layer will be accelerated away from the grain. An excited electron 
can in principle penetrate the double layer by tunneling, so that it is not 
actually necessary for the electron to have an energy higher than the top 
of the double layer. 

An insulating grain has all the “valence band” energy levels occupied when it 
is neutral. The difference in energy between the top of the valence band and 
the first available vacant energy level (in the “conduction band”) is referred 
to as the “band gap”. If an insulating grain is to be negatively charged, 
the excess electrons must occupy available energy levels in the conduction 
band. Less energy is required to photo-eject one of these “excess” electrons 
(a process known as “photo-detachment”) than to eject an electron from the 
top of the valence band. 

When an ultraviolet photon is absorbed in a dust grain, it raises an elec- 
tron to an excited state. If the electron has an energy high enough to reach 
infinity, and does not lose this energy to inelastic scattering, it will escape 
and be counted as a “photo-electron” . To calculate the rate of photoelectric 
charging of a grain, we require the “photoelectric yield” Y(hv^ Z, a), which is 
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IP(-1) = EA(0) 



Z = -1 









IPv(-l)=W-|e. + q.s. 







Fig. 26. Model for the potential confining electrons in a grain with charge Ze. 
Shaded regions show occupied energy levels. W is the work function for bulk ma- 
terial; IP(Z) is the ionization potential; EA{Z) is the electron affinity; hupet is the 
photoelectric threshold energy; hup^t is the photo-detachment threshold energy (the 
minimum photon energy required to detach an electron from a negatively charged 
grain); IPv{Z) is the energy difference between infinity and the top of the valence 
band; Fbg is the bandgap energy for the material (Fbg = 0 for a metal). Taken 
from Weingartner & Draine (2001c). 



the probability that absorption of a photon hv will produce a photo-electron. 
To calculate the rate of photoelectric heating of the gas, we also need to 
know the energy distribution function f{E, hv, Z, a) of the energy E of the 
photoelectrons leaving the grain. 

The photoelectric ejection process consists of four stages: 

1. Excitation of an electron of appropriate energy. 

2. Motion of the electron from the point of excitation to the grain surface, 

3. Penetration of the surface layer and overcoming of the image potential. 

4. Once outside the grain surface, the electron must overcome the Coulomb 
potential (if the grain is now positively charged) to reach infinity. 

Weingartner & Draine (2001c) have estimated the photoelectric yield for 
small grains, both neutral and charged, taking into account the above effects. 
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hi/(eV) 

Fig. 27. Photoelectric yield for neutral coronene, as measured by Verstraete et 
al. (1990) and as estimated in the approach of Weingartner & Draine (2001c). 
Equations 11, 12, 13, 16 refer to equations in Weingartner & Draine (2001c), from 
which this figure is taken. 



Allowance is made for dependence of the image potential on the grain radius 
a, and the possibility of tunneling through the double layer in the case of a 
negatively charged grain. 

Watson (1973) pointed out that very small particles should have en- 
hanced photoelectric yields because of step 2 above: in a small particle, 
photo-excitations will on average be closer to the grain surface, increasing 
the probability that the excited electron will reach the grain surface with- 
out energy loss. Draine (1978) proposed a simple function to estimate the 
magnitude of this yield enhancement, which we use: 



/ — 2 q ;-|-2 — 2 exp(— a) 
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where la = A/[47rIm(TO)] is the photon attenuation length in the material, 
and le, the “electron escape length”, is essentially the mean free path against 
energy loss to inelastic scattering for an excited electron. In the limit a <C 1 
we have 



yi 



a 

P 



la le 



(67) 



Photon attenuation lengths la ~ 300A are typical in the vacuum ultra- 
violet. Martin et al. (1987) report le ~ 9 A for 6eV electrons in thin carbon 
films, and McFeely et al. (1990) report le ~ 6A for 8eV electrons in Si02. 
Weingartner & Draine (2001c) adopt le = 10 A, independent of energy, for 
both graphite and silicate grains. Since le «C la, it is clear that very large 
yield enhancement factors yi are possible for very small grains. 

Weingartner & Draine write 



Y {hv, Z, a) = min [t/q x lji{hv, a)] x y 2 {hv, Z, a) (68) 



where j /2 is the fraction of electrons which, having crossed the grain surface, 
have sufficient energy to overcome the long-range coulomb attraction if Z > 0 
(for Z < 0, 7/2 = !)• For PAHs and larger carbonaceous grains, yo is chosen 
so that Y = yoyiy 2 approximates the photoionization yield measured for 
coronene. 

Figure 28 shows the photoelectric yields estimated for neutral carbona- 
ceous and silicate grains of various radii. 



4.3 Charge Distribution Functions 

Figure 29 shows grain charge distribution functions calculated for dust in a 
molecular cloud region. The H nucleon density is taken to be nn = 300 cm“^, 
the gas temperature T = 25K, and the fractional ionization Ue/n^ = 10“^. 
The radiation field is assumed to have the spectrum of the MMP ISRF, but 
with intensity a factor of 10 below the MMP value. Under these conditions 
small grains (a A 200A) remain negatively charged. For larger grains, the 
reduced importance of “image charge” effects (as well as a slight reduction 
in the work function) tilts the balance in favor of photoelectric charging, but 
the grain potentials remain very small. 

Figure 30 shows grain charge distribution functions calculated for a dif- 
fuse region with starlight equal to the MMP ISRF. The electron density 
He = 0.03 cm“^ is taken to be the same as for the molecular cloud region 
of Fig. 29, but the increased starlight intensity allows photoelectric emis- 
sion to dominate the charging for grains with a A 25A. function The “CNM” 
conditions are appropriate to diffuse HI at 100 K. Despite the presence of ul- 
traviolet radiation producing photoelectric emission, note that an appreciable 
fraction of the small grains can be neutral or even negatively charged. 

We see that the competition between collisional charging and photoelec- 
tric charging can go either way, depending on the grain composition, the 
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Fig. 28. Photoelectric yields (photoelectrons per absorbed photon) for neutral 
graphite and silicate grains as a function of incident photon energy hv, for selected 
values of the grain radius a. From Weingartner & Draine (2001c). 



grain size a, the electron density rig and temperature T, and the spectrum 
and intensity of the ultraviolet background due to starlight. 

5 Dynamics of Interstellar Dust 

What are the velocities of interstellar grains? To answer this question, we 
must understand the forces acting to accelerate or decelerate the grains. 

5.1 Gas Drag 

If the grain is moving relative to the gas, it will experience a drag force 
as momentum is transferred from the grain to the gas. This drag force is 
approximately Tdrag = MgrUgr/rgr where Mgr is the grain mass, Ugr is the 
grain speed relative to the gas, and Tgr is the time for the grain to collide 
with its own mass in gas particles. For the case of an uncharged spherical 
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Fig. 30. Charge distribution functions for carbonaceous grains and silicate grains 
in “cold neutral medium” diffuse HI. 
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grain in neutral gas, the problem can be solved exactly, and approximated to 
within 1% by 
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where the sum is over the species (H, H 2 , He) in the gas (Draine & Salpeter 
1979a). If the grain is charged and ions are present, there is an additional 
“Coulomb drag” term (Draine & Salpeter 1979a), but this is numerically 
unimportant in regions of fractional ionization ^ 10“^. Equation (69) is ob- 
tained for either perfectly inelastic collision (where the gas atom is assumed 
to “stick” ) to the grain or perfectly elastic specular reflections. For subsonic 
motion, the drag time is 
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For a cool region where the hydrogen is primarily molecular. 
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5.2 Lorentz Force 



Dust grains are charged, and there are magnetic fields in clouds. The cy- 
clotron period is 
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where we take {Z) because the grain charge fluctuates on a time short com- 
pared to the cyclotron period. Values of (Z) are indicated in Figs. 29 and 30. 
The character of the grain dynamics depends on the ratio of the drag force 
to the Lorentz force, or 
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where we have assumed the hydrogen to be atomic. For grains in cold clouds, 
we see that we usually have WcTdrag ^ 1: grains are strongly coupled to the 
magnetic field, except for grain sizes where collisional charging and photo- 
electric charging happen to just balance so that \{Z) \ ^ 0.02(a/10“® cm)^. 



5.3 Radiation Pressure 

The interstellar radiation field is in general anisotropic. As a result, dust 
grains are subject to “radiation pressure” forces which can cause drift of 
the grain relative to the gas. This drift can be important as a means of 
transporting dust through gas, but also because the drifting grains transfer 
momentum to the magnetic field (via the Lorentz force) and to the gas (via 
gas drag). 

Consider a unidirectional beam of photons of energy hv, with energy 
density u,ydv in interval v + dv). The photons have momentum hvjc, and 
this momentum can be transferred to the grain by absorption or by scattering. 
It is easy to see that the rate of momentum transfer is 

Frad = ^ [^abs + Csca(l ~ (cOS ^))] ^ (77) 

— 7T(l [ Hy dv Qpr{^^ 5 Qpr(^) = Qabs Qsca(l (cOS ^)) . (78) 



where Qabs = Cabs/Tra^, Qsca = Cgca/Tra^, and (cosO) is the mean of the 
cosine of the scattering angle. 

We are generally interested in values of Qabs and Qpr averaged over the 
spectrum of the radiation. 

We are often content to approximate the radiation from a star by a 
blackbody spectrum. In neutral regions of the interstellar medium we are 
frequently interested in averages over a blackbody spectrum with a cutoff at 
13.6 eV, and it is useful to calculate spectrum-averaged values 



_ r--dvB,{T)Q{v) 



(79) 



where B^{T) is the Planck function (54) and = 13.6 eV. 

We will also be interested in Q values averaged over the interstellar ra- 
diation field, for which we use the estimate of Mathis, Mezger, & Panagia 
(1983). 
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Suppose that at some point in the interstellar medium the starlight has a 
net energy fiux cAurad- By summing over the Skymap Star Catalog (Slater & 
Hashmall 1992), the Tycho Catalog (ESA 1997), and the all-sky ultraviolet 
observations of the S2/68 experiment on the TD-IA satellite, Weingartner & 
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Fig. 31. Radiation pressure efficiency factor (Qpr) for neutral carbonaceous and 
silicate grains, averaged over the interstellar radiation field (ISRF) and blackbody 
spectra with indicated color temperatures, cut off at 13.6 eV. Taken from Wein- 
gartner & Draine (2001b). 



Draine (2001b) estimated the anisotropy in the starlight background at V, 
B, and 4 ultraviolet bands. In Fig. 32 we show the net dipole moment of the 
radiation field at these 6 wavelengths. 

5.4 Recoil from Photoelectric Emission and Photo-desorption 

Radiation pressure is, of course, just the transfer of photon momentum to 
the grain. However, radiation can serve to energize other processes which can 
exert a thrust on the grain: photoelectric emission and photo-desorption. To 
see that these might be important, it is sufficient to compare the momentum 
of an hv « 10 eV photon with the momentum of the photo-electron it might 
produce, if the photo-electron kinetic energy = / x /uz: 

p(electron) {2rriefhvY^‘^ - 320^1/2 /lOeVy/^ 

p(photon) hvjc \ hv ) 



(81) 
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Fig. 32. Dipole fraction for the interstellar radiation field. Upper curve is the ratio 
of the measured dipole moment to the measured starlight (the measurements are 
incomplete). Lower curve is the ratio of the measured dipole moment to the total 
estimated ISRF. The true fraction is probably between these two curves. After 
Weingartner & Draine (2001b). 



Similarly, suppose that a photon caused an H 2 molecule to be desorbed with 
kinetic energy / x hv. 



p(photon) hvjc ’ \ hv ) 



(82) 



Therefore if an appreciable fraction of absorbed photons produce photoelec- 
trons or photodesorb atoms or molecules, there could be a significant recoil 
on the grain. 

For a spherical grain in an isotropic radiation field, the photoelectrons 
would be emitted isotropically, with zero net thrust. Suppose, however, that 
we have a unidirectional radiation field (e.g., light from a single star). In 
this case we might expect a higher rate of photo-electron emission from the 
“bright side” than from the “dark side” of the grain. However, very small 
grains do not have a “dark side” - they are effectively transparent to the 
incident radiation, so one must try to estimate how this depends on grain 
size. How can we estimate the bright side/dark side asymmetry? A simple 
model is to assume that the rate of photoelectric emission from a point on the 
grain surface is proportional to |Ep just inside the surface. Using Mie theory 
to calculate the electric field intensity E in a spherical grain, one can then 
evaluate the relative rates of photoelectric emission over the grain surface. 
Using this approach, Weingartner & Draine (2001b) have investigated the 
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net thrust due to escaping photoelectrons - equal and opposite to the net 
momentum which the escaping electrons have when they reach infinity. The 
photoelectric yield, and the energy of the photoelectrons, is of course affected 
by the charge on the grain, and therefore depends on the environmental 
conditions. The analysis is complicated by the fact that if the grain is charged, 
photoelectrons with nonzero angular momentum will escape to infinity in 
directions different from the direction they are travelling when they emerge 
from the grain surface (until it reaches infinity, the electric field of the photo- 
electron exerts a force on the grain). 

Figure 33 shows the results of Weingartner & Draine (2001b) for the 
ratio of the photoelectric thrust to the radiation pressure force, as a function 
of grain size. We see that this is an order-unity correction for grains with 
a « lOOA in the CNM. 

For photoelectric emission, it is reasonable to assume that the grain is 
“spherically symmetric” prior to arrival of the photon - there are just as many 
electrons available for photo-ejection on the bright side as on the dark side of 
the grain. For photo-desorption, however, this is not a reasonable assumption. 
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Fig. 33. Photoelectric thrust and photo-desorption thrust relative to radiation pres- 
sure for carbonaceous and silicate grains as a function of grain radius a. Curve 
labelled “CNM” is for cold diffuse clouds with Ue = 0.03 cm“®, T = 100 K, and 
the standard ISRF. Note that the photo-desorption force can be negative for grains 
with radii lOOA ^ a ^ 500A. From Weingartner & Draine (2001b). 
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As we will see in §6 below, grains are expected to be spinning rapidly, with 
their principal axis of largest moment of inertia expected to be aligned with 
the angular momentum. Because of this gyroscopic stabilization, one may 
expect that if photo-desorption is a rapid process, then the “bright” side of 
the grain will have fewer adsorbed molecules ready to be photo-desorbed, an 
effect which will depend on the orientation of the grain’s spin axis with the 
direction of radiation anisotropy. 

If photo-desorption is rapid, the thrust will be limited by the need to 
resupply the grain surface with new molecules to be photo-desorbed. The 
story has a number of additional complications: 

• The result will depend on the wavelength-dependence of the photo-des- 
orption cross sections for the adsorbed species. This is essentially un- 
known. Weingartner & Draine simply assume that the photo-desorption 
rate for a molecule on the grain surface is proportional to |Ep just outside 
the grain surface, integrated from 6 — 13.6 eV. 

• Adsorbed species may diffuse over the grain surface, so that the “bright 
side” could be replenished from the “dark side”. 

• If photo-desorption is rapid, the surface coverage of adsorbates will de- 
pend on the orientation of the grain spin axis relative to the direction of 
starlight anisotropy. 

• The spinning grain can undergo “polar flips” ~ the “thermal flipping” 
process described by Lazarian & Draine (1999a). 

• If the grain is moving, the grain motion will alter the rates at which 
molecules are resupplied to the different parts of the grain surface: the 
“bright side” may suffer a reduction in the rate of arrival of atoms or 
molecules from the gas. 

• H atoms which are not photo-desorbed might react on the grain surface to 
form H 2 , and could recoil from the grain with substantial kinetic energy. 
This process might happen more rapidly on the dark side where the H 
concentration is higher, thus acting to oppose the thrust due to photo- 
desorption. 

With so many uncertainties, it is not possible to reach definite quantitative 
estimates for the photo-desorption force, but Weingartner & Draine (2001b) 
evaluate one plausible set of assumptions; results are shown in Fig. 33 for 
grains in the “warm neutral medium” and the “cold neutral medium”. It is 
curious that there is a range of grain sizes (100— 400A for carbonaceous grains, 
100 — lOOOA for silicate grains) where they find that the photo-desorption 
force is negative - this occurs because for this range of sizes, interference 
effects cause |Ep to be larger on what would have been expected to be the 
dark side of the grain if the grain were opaque and geometric optics were ap- 
plicable. Note that the larger grains do behave as “expected” for macroscopic 
targets. 

The “bottom line” is that photo-desorption can make a significant con- 
tribution to the net force on the grain, which (coincidentally) is expected to 
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be comparable in magnitude to the radiation pressure force and the photo- 
electric thrust, but which can in principle be of the opposite sign, depending 
on detailed physics of photo-desorption which are not known at this time. 

5.5 Drift Velocities for Interstellar Grains 

For the special case where the radiation anisotropy is parallel to the magnetic 
field direction, we can obtain the steady-state grain drift velocity by balancing 
the drag force against the radiation-related forces: 



.^drag — .^rad + Fpe + F'pd , (83) 

where Fpg and Fp^ are the forces due to photoelectric emission and photo- 
desorption. If we assume subsonic motion, and neglect Coulomb drag, this 
can be written 

'Cdrift \ / \ 1 F Fp^j Fj-ad F I^pd/.^rad 

^kT/m-R \riYikT ) 

so we see that the grain will drift with a “Mach number” which is approxi- 
mately equal to (Qpr) times the ratio of the anisotropic radiation pressure to 
the gas pressure. Since Qp^ reaches values of order unity (see Fig. 31), we may 
expect peak Mach numbers or order 0.2 or so - the ratio of the anisotropic 
radiation energy density to the gas pressure in the diffuse ISM. 

In Fig. 34 we show drift speeds calculated by Weingartner & Draine for 
carbonaceous and silicate grains in the WNM and CNM. The drift speeds 
are clearly a strong function of grain size, and appear to be negligible for 
small a lOOA grains. For larger grains the drift velocities are modest but 
potentially significant since they could be sustained for long times: in the 
CNM, a radiation anisotropy of only Au^ad/urad = 0.1 would give a 0.3 /rm 
carbonaceous grain a drift speed of ~ 0.04kms“^, so in lOMyr it could drift 
0.4 pc, and in the WNM the drift speeds are about an order of magnitude 
larger. This could in principle lead to removal of the large grains from some 
gas elements and concentration in others. 

6 Rotational Dynamics of Interstellar Dust 

The rotational dynamics of interstellar grains constitutes a fascinating story, 
not yet fully told. It is intimately tied to the long-standing problem of inter- 
stellar grain alignment, which we will discuss in §7. 

6.1 Brownian Rotation? 

Early discussions of the rotational dynamics of interstellar grains generally 
assumed the grains to be rigid spheres, which appeared to capture the essen- 
tial physics, and discussed the scattering of impinging atoms from the grain 
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Fig. 34. Drift velocities for carbonaceous and silicate grains, as a function of grain 
size, in the CNM and WNM, for a radiation anisotropy Ziurad/uiSRP = 0.1. Taken 
from Weingartner & Draine (2001b). The minimum in drift velocity for a ~ 250A 
carbonaceous grains or a « 350A silicate grains in the CNM is due to the reversal 
of the photo-desorption force for grains in this size range (see Fig. 33). 
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surface. As far as grain rotation was concerned, it seemed obvious that the 
grain would simply undergo a form of Brownian motion, exchanging linear 
and angular momentum with the gas. Once statistical equilibrium was at- 
tained, a rigid grain (with purely elastic scattering of atoms from its surface) 
would have an expectation value of /cTgas/2 for the kinetic energy in each 
of its degrees of freedom: three translational and three rotational. Thus the 
r.m.s. translational velocity of the grain would be 




which is negligible under all foreseeable circumstances. The r.m.s. rotation 
rate 
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is more impressive, though it simply corresponds to an equatorial velocity 
equal to the r.m.s. translational velocity (86) times a factor 1^5/2 = 1.58. 



6.2 Suprathermal Rotation 

Following the discovery of grain alignment in 1949, theoretical studies of the 
rotational dynamics were carried out to try to understand the alignment 
process, including the seminal paper by Davis & Greenstein (1951). It was 
recognized that, since interstellar grains were likely to have vibrational tem- 
peratures differing from the gas temperature. Equation (88) needed to be 
modified, and it was understood that for a spherical grain the temperature T 
appearing in Eq. (88) should be some weighted mean of the gas temperature 
and the grain temperature, since collisions of atoms with the grain surface 
will not be perfectly elastic. 

However, in 1975 - 24 years after Davis & Greenstein’s study - Purcell 
(1975) realized that interstellar grains could act as “pinwheels”. After all, a 
simple pinwheel weighting many grams can readily be put into rotation with 
a rotational kinetic energy many orders of magnitude greater than kT by the 
air molecules which excite it. Purcell (1979) showed, in fact, that there were 
3 distinct physical processes acting on interstellar grains which could each, 
acting alone, cause a grain to rotate with a rotational energy (l/2)/w^ kT: 

1. The effective “accommodation coefficient” will probably vary over the 
grain surface, due to chemical inhomogeneities and/or geometrical irreg- 
ularities. Such variations in accommodation coefficient on a non-spherical 
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grain will lead to a systematic torque (i.e., one which whose time-average 
is not expected to be zero) if the gas temperature and grain temperature 
differ. 

2. Photoelectrons ejected from the grain may, on average, carry away angu- 
lar momentum, resulting in a systematic torque. 

3. H2 formation on the grain surface, with the nascent H2 molecules ejected 
with a significant kinetic energy, is expected to result in a large system- 
atic torque, particularly if the H2 formation occurs at a relatively small 
number of “active sites” on the grain surface. 

Purcell showed that of the three processes, the H2 formation process was 
likely to be the most important in regions where the H is atomic. 

6.2.1 Suprathermal Rotation Driven by H 2 Formation 

It is instructive to examine the argument for suprathermal rotation driven 
by H2 formation at active sites on the grain surface. 

To simplify the argument, let us consider a region where only atomic H is 
present (no He), and consider a cube of sides 26 x 26 x 26, with volume 86^, 
surface area 246^, and effective radius Oeff = (6/7 t )^/^6 = 1.246. 

Choose a site at random on the grain surface, and assume that H2 molecules 
are formed at that site at a rate iVi . For simplicity, suppose that the newly- 
formed H2 molecules leave normal to the surface with kinetic energy 
The expectation value for the square of the moment arm is then just (2/3)6^, 
and the expectation value for the mean square torque exerted by this site is 

rf = iVf [mH^WHs]^ X (89) 

= . (90) 

Now suppose that there are a total of A^site such sites on the grain surface, 
and suppose that a fraction 7 of the H atoms which impinge on the surface 
are converted to H2 (so there are 7/2 H2 formation events per arriving H 
atom). In a thermal gas, the rate per area at which atoms collide with a 
convex surface is just n{kT j2nmY^'^ , so 
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(91) 



Since the sites are assumed to be randomly located, the torques add like a 
random walk, so that the expectation value for the square of the total torque 
is just 
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The rotating grain will experience gas drag. For slow rotation rates w, it is 
not difficult to show that the drag torque is 



r 

r drag — g 

To find (w^), we now set - 

{iO^) = 



= (-^drag) to obtain 
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or a rotational kinetic energy, relative to the thermal value, 
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where M = 8b^p is the grain mass. Now this is no small number! The number 
of active sites is uncertain. If we suppose that there is a surface area per 
active site, then NgUe = ‘IW? j , and 
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so that we might have MHz rotational frequencies for 6 < 3 x 10“® cm! It is 
apparent that systematic torques can play a major role in grain dynamics. 

Each H 2 formation site can be thought of as being like a small rocket 
thruster attached to the grain surface: the systematic torque due to H 2 for- 
mation is fixed in body coordinates, so long as the H 2 formation sites do not 
change. The kinetics of H 2 formation on grain surfaces is poorly understood 
at this time, and it is not certain how long-lived the active sites are likely to 
be. 



6.2.2 Radiative Torques Due to Starlight 

The three torques identified by Purcell do indeed appear to be individually 
capable of driving grains to suprathermal rotation. In fact, there appears to 
be an additional physical process which can compete with the H 2 formation 
torque for grains larger than ~ 10“® cm: torques exerted on interstellar grains 
by starlight. 




Astrophysics of Dust in Cold Clouds 271 



We discussed above the “radiation pressure” force which anisotropic star- 
light could exert on a grain. It turns out that starlight can also exert a torque 
on grains, and these torques can be dynamically important! 

How can we address this problem? Analytic progress is difficult, because 
the only cases for which we have analytic solutions are cases where radia- 
tive torques vanish: (1) grains of complex shape but small compared to the 
wavelength; (2) spheres; (3) spheroids. In the first case the torques vanish 
because only the dipole response of the grain is important, and the dipole 
radiation does not carry angular angular momentum. In the case of spheres 
and spheroids, there is no torque because of the symmetry of the target. 

In order to obtain a nonzero torque we must have an asymmetric target. 
Draine & Weingartner (1996) discussed use of the discrete dipole approxi- 
mation to calculate radiative torques. The DDSCAT code (Draine & Flatau 
2000) now provides the capability to compute torques. For an asymmetric 
target illuminated by a beam of unpolarized radiation, the direction and 
magnitude of the torque depends on the orientation of the target relative to 
the beam. It is useful to define a “torque efficiency vector” Qp defined so 
that the torque on the grain is given by 



Frad — ^^efF^radr, Qf ? (1!^!^) 

Z7T 

where the effective radius Oeff is the radius of an equal volume sphere. We 
will generally want to average Qr over some spectrum; Draine & Weingartner 
(1996) calculate 
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Fig. 35. Macroscopic examples of targets which would be subject to radiative 
torques. From Draine & Weingartner (1996). 
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for u\ taken to be the MMP ISRF (for which A = 1.202 //m). With these 
definitions, the torque 
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We recall from classical mechanics that an arbitrary object has three 
principal axes, ai, & 2 , & 3 , and the moment of inertia tensor is diagonal in a 
coordinate system aligned with these three axes. Let the eigenvalues of the 
moment of inertia tensor be Ii > I 2 ^ Is- 

For purposes of discussion, let us assume that Ii > I 2 = h- If the grain 
has angular momentum J, then the rotational kinetic energy is 
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with the rotational kinetic energy minimized for /3 = 0. As we will see below, 
when J is large we expect the grain to remain close to this minimum energy 
state, with sin^ /3 <C 1. 

The grain spins rapidly, so we are concerned with the torque after aver- 
aging over rotations of the grain around the axis ai. This rotation-averaged 
torque therefore depends only on a single angle - the angle 0 between ai 
and the direction of the incident radiation. 

We will see below that we expect a spinning dust grain to process around 
the direction of the local magnetic field B , so what we really need is the torque 
averaged over grain rotations and over precession of the axis ai around the 
magnetic field direction. For a given grain and spectrum of radiation, this 
will now depend on two angles: the angle ip between B and the direction of 
the radiation, and the angle ^ between ai and B. 

For purposes of spinup (or spindown), we are interested in the projection 
of the torque efficiency vector along the spin axis, i.e., the axis of largest 
moment of inertia: Qr(0)-aj^. Draine & Weingartner (1996, 1997) define 
= (Qr • ai) averaged over rotation around ai and precession of ai 
around B. They calculate H(^,(j)) for three grain shapes with effective radii 
Oeff = 0.2 /im. The numerical values of H of course depend on ^ and 4>, but a 
typical value of \H\ would be \H\ « 0.005 (see Fig. 5 of Draine & Weingartner 
1997), so that the spinup torque would be 
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It is of interest to compare this to the spinup torque due to H2 forma- 
tion. We can rewrite Eq. (93) replacing b by (7r/6)^/^aeff, and project onto a 
random direction ai to obtain 

((r„.a,)=)>/»=l(A) (t) T(l)f) . a(H)tT , (107) 

Thus the ratio of the radiative spinup torque (106) to the r.m.s. value of the 
H2 formation torque is 
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Adopting the 10% anisotropy estimated by Weingartner & Draine (2001b), we 
have Awrad ~ 8.64 x 10“^^ergcm“^. With n(H) = 30cm“^, and T = 100 K, 
we have Auys^^/n{Il)kT = 0.21. Therefore with an H2 formation efficiency 
7 < 0.5 the radiative torque would exceed the H2 formation torque if the 
other parameters have nominal values (in particular, s = lOA). Many of the 
numbers are uncertain; the important point is that for Oefi = 0.2 /im grains 
the radiative torque has a magnitude which is comparable to our estimate 
for the H2 formation torque. 



7 Alignment of Interstellar Dust 

Polarization of starlight was discovered in 1949. It was immediately deduced 
that the only plausible mechanism was “linear dichroism” of the interstellar 
medium due to aligned dust grains, but understanding the mechanism re- 
sponsible for this alignment has frustrated astrophysicists for half a century. 
In fact, the story of the efforts of astrophysicists to understand grain align- 
ment is humbling. It is not just because it has taken us so long to solve the 
problem - the humbling lesson is that several very important and relatively 
simple physical effects were overlooked for decades. It was not that these 
physical effects were difficult to understand, or that they involved physics 
which has only recently been understood. On the contrary - the physics in- 
volved would have been familiar to well-trained physicist in 1950. It was 
simply that important processes were overlooked: 

• The Rowland effect: a charged, spinning dust grain will develop a mag- 
netic moment due to its circulating charge (Martin 1971). 

• The Barnett effect: a spinning dust grain with unpaired electron spins 
will spontaneously magnetize (Dolginov & Mytrophanov 1976). 

• Suprathermal rotation due to dust-gas temperature differences (Purcell 
1975, 1979). 
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Fig. 36. Linear polarization of light from 1800 stars. From Mathewson & Ford 
1970. 



• Suprathermal rotation due to photoelectric emission (Purcell 1975, 1979). 

• Suprathermal rotation due to H 2 formation (Purcell 1975, 1979). 

• Viscoelastic dissipation of rotational kinetic energy due to time-varying 
stresses in a grain which is not rotating around a principal axis (Purcell 
1979) 

• “Barnett dissipation” of rotational kinetic energy due to the electron spin 
system (Purcell 1979). 

• Dissipation of rotational kinetic energy due to the nuclear spin system 
(Purcell 1979). 

• Suprathermal rotation due to starlight torques (Draine & Weingartner 
1996). 

• Fluctuation phenomena associated with Barnett dissipation (Lazarian & 
Roberge 1997; Lazarian & Draine 1999a) and coupling to the nuclear 
spins (Lazarian & Draine 1999b). 

Have we thought of all the important processes? Or are there other important 
phenomena to which our eyes are still closed? 

7.1 The Motion of Spinning Grains 

As seen above, dust grains spin very rapidly. Consider for the moment one 
spinning grain. If the angular momentum J does not happen to be along one 
of the grain’s principal axes, then the grain’s angular velocity uj will not be 
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parallel to J, and will nutate around J, with a nutation rate that will be 
smaller than (but comparable to) the rotation rate. 

Now imagine observing a single grain as it spins at kHz rotation frequen- 
cies. Because the rotation and nutation are both rapid, we will quickly obtain 
a rotation-nutation average of the grain orientation. During this observation, 
the angular momentum J is conserved, and defines a preferred direction. 
Now consider light propagating in some particular direction. Because of the 
rotation-nutation averaging, the grain will tend to polarize the light with 
electric vector either exactly perpendicular to J, or exactly parallel to J, de- 
pending on whether the grain on average has its “long” axis perpendicular 
to J, or parallel to J. 

This is for one grain. If a population of grains shows a tendency to po- 
larize light, then the individual grain angular momenta cannot be randomly- 
oriented - there must be a tendency for the angular momenta J to be aligned 
with some preferred direction in space. 

The problem of grain alignment is to account for the alignment of grain 
angular momenta J. 

Many alignment mechanisms have been proposed over the years. Some 
directly involve the galactic magnetic field (Davis & Greenstein 1951). Other 
proposed mechanisms rely on so-called “mechanical alignment” due to gas 
(Gold 1952) or cosmic rays (Salpeter & Wickramasinghe 1969) streaming 
relative to the grains. Harwit (1970) suggested that grain angular momenta 
might be aligned as a result of the angular momentum absorbed from starlight 
photons, if the starlight is anisotropic. It was only in 1971 that Martin (1971) 
pointed out that even in the case of nonmagnetic alignment processes, the 
observed alignment would be either exactly parallel or exactly perpendicular 
to the magnetic field, simply as a result of the magnetic moment on a spinning 
grain. 



7.2 Grain Magnetic Moments and Precession Around B 

Grain in diffuse regions tend to be electrically charged (see Fig. 30). If 
charged, a spinning dust grain will develop a magnetic moment due to the 
“Rowland effect”: the rotating charge on the grain constitutes a current circu- 
lating around a loop, and there will be a resulting magnetic field and magnetic 
moment interior to the loop. Recall that a current I flowing in a loop with 
area A generates a magnetic moment m = lA/c. The mean of over 

the surface of a sphere is just (2/3)a^, so the magnetic moment due to charge 
Ze distributed uniformly over the surface of a spinning sphere is 
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If oj is at an angle 0 > 0 relative to B, the spinning sphere will precess with 
a precession frequency 
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We have seen above (Fig. 30) that in a CNM environment, we might expect 
an a = 10“® cm grain to have {Z) « 10, for a precession period ~ 6000 yr. 
This time is short compared to the other dynamical times: the gas drag time 
(72), or (as we shall see below) the time for grain alignment to occur. 

We can therefore conclude that regardless of what process is responsi- 
ble for alignment of interstellar grains, each grain will have a time-averaged 
angular momentum distribution which must be symmetric around B. There- 
fore the interstellar dust mixture will tend to polarize starlight either exactly 
parallel to B or exactly perpendicular to B, for any conceivable alignment 
mechanism! 



7.3 Barnett Effect 

Dolginov & Mytrophanov (1976) pointed out that the “Barnett effect” would 
result in grain magnetic moments much larger than due to the Rowland effect. 
The Barnett effect can be stated very simply: an uncharged object rotating 
with angular velocity ui tends to spontaneously magnetize, with a magneti- 
zation M = x^Hi where x is the susceptibility, and 7 is the “gyromagnetic 
ratio” for the material - the ratio of (magnetic moment)/(angular momen- 
tum) for the electron orbitals or electron spins responsible for the magneti- 
zability of the material. For electron spins, 7 = /is/(?i/2) = e/2meC, where 
Pb = eTil2meC is the Bohr magneton. 

The spontaneous magnetization is equal to that which would be produced 
in a stationary sample in a (fictitious) applied magnetic field Hbb = ojjx = 
2meCw/e - the so-called “Barnett equivalent field”. 

At first sight the Barnett effect seems surprising, but it is not difficult to 
understand. Consider an unmagnetized sample (equal numbers of “spin-up” 
and “spin-down” electrons), with angular momentum J, spinning at angular 
velocity uj. At constant total angular momentum, the angular velocity of the 
sample can be reduced, with consequent reduction of the rotational kinetic 
energy Ia;^/ 2 , if some of the angular momentum is put into the electron spin 
system. The electrons have magnetic moments, so with unequal numbers of 
electrons in the two spin states, the sample is now magnetized. The reduction 
in rotational kinetic energy is accompanied by heating of the grain, with an 
increase in entropy. The effect was observed in the laboratory by Barnett 
(1915). 




Astrophysics of Dust in Cold Clouds 277 



For normal paramagnetic materials, the magnetic susceptibility is just 
X = nsH%/kTg, where rtg is the number density of unpaired electrons, and Tg 
is the grain temperature. The magnetic moment due to the Barnett effect is 
thus 
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An unpaired electron density Ug « 10^^ cm“^ is typical for normal para- 
magnetic materials, so it is clear that the Barnett effect is very important 
for interstellar grains, and will result in precession periods of order days or 
months rather than the thousands of years for the Rowland effect. 



7.4 Alignment by Paramagnetic Dissipation: 

Davis-Greenstein Mechanism 

Davis and Greenstein (1951) described a mechanism which could produce 
grain alignment. Consider a grain spinning with angular velocity w in a static 
magnetic field Bq. Let 9 be the angle between Bq and oj. 

The grain material has a magnetic susceptibility y, and it will try to 
magnetize in response to an applied magnetic field. In grain body coordinates, 
the applied field has a static component parallel to the grain rotation axis, 
with magnitude Ry = Bq cos 9, and a component of magnitude Rj_ = Bq sin 9 
which appears to be rotating, with angular velocity ui relative to the grain. 
The rotating field Bj_ is perpendicular to u. 

It seems clear that the grain material will develop a magnetization M 
which will have a static component My = yo^|| parallel to oj, and a com- 
ponent Mj_ which rotates in body coordinates (but is stationary in inertial 
coordinates), with magnitude Mj_ = |y(w)|Rj_, where y('^) is the (complex) 
magnetic susceptibility for a magnetic field rotating at frequency to. In body 
coordinates, the rotating magnetization Mj^ will “lag” behind the rotating 
field Bj_ - there will be an “in-phase” component xi(w)Rj_ and an “out-of- 
phase” component X 2 (u’)R_l- 

The galactic magnetic field Bq will exert a torque on the grain magnetic 
moment RM. The torque is obviously T to Bq. A bit of calculation shows 
that the out-of-phase component of the magnetization results in a torque of 
magnitude Vx 2 B±Bq = Vx2BqS\xi9 which acts to leave the component of 
the angular momentum || to Bq unchanged (the torque, after all, must be 
T Bq) but acts to reduce the component Jj_ = Jsin6* perpendicular to Bq: 

^J_L = -Vx2Bls\n9 
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At low frequencies the imaginary component of the magnetic susceptibility 
varies linearly with frequency: \2 = Kui. With u> = J / 1 = J||//cos0, noting 
that J|| = const, we obtain 



with solution 
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The magnetic susceptibilities of paramagnetic materials are quite well under- 
stood at the frequencies of interest (see, e.g., Morrish 1965), with 
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Thermally-rotating grains are disaligned by random collisions with gas atoms 
on a time scale Tdrag- The ratio of the alignment time to the disalignment 
time is 




where we have assumed the hydrogen to be atomic. Thus we see that even for 
a = 10“® cm grains and a relatively high value of the magnetic field strength, 
alignment is slow compared to disalignment, and the ratio is linear in the 
grain radius a. The Davis-Greenstein alignment mechanism favors alignment 
of smaller grains, but this is a problem, since small grains are observed to 
NOT be aligned, while large grains are (Kim & Martin 1995). As we will see 
below, some important physics has been overlooked. 



7.5 Effect of Suprathermal Rotation 

Gonsider now a grain rotating suprathermally. The Davis-Greenstein align- 
ment time is independent of the rotation rate, since the torque is propor- 
tional to the rate of rotation. On the other hand, the time scale for random 
collisions to change the direction of the angular momentum is approximately 
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(•^/'Ah)^7drag; where Jth is the angular momentum expected for Brownian ro- 
tation. So if the grain rotation is suprathermal, with J/ Jth ^ we can virtu- 
ally ignore the disaligning effects of random collisions, and Davis-Greenstein 
paramagnetic dissipation will bring the grain into alignment with Bq in a few 
million years. As we have seen in §6.2, suprathermal rotation appears to be 
expected. Could it for some reason be suppressed in small grains? 

7.6 Thermal Flipping vs. Suprathermal Rotation 

Why aren’t small grains aligned? From Eq. (98) it certainly appears that H 2 
formation should be able to drive grains as small as 10“®cm into suprather- 
mal rotation, and yet there is strong evidence that grains smaller than 10“® cm 
are not aligned (Kim & Martin 1995). 

Lazarian & Draine (1999a) have proposed an explanation for this - the 
process of “thermal flipping” may prevent small grains from achieving supra- 
thermal rotation. 

The key issue is that the grain does NOT behave like a “rigid body” 
subject only to external torques - the grain has internal vibrational degrees of 
freedom. The grain rotational kinetic energy depends on the angle (3 between 
principal axis ai and J through Eq. (105): 

i?rot = ;^ + j"%^sin2/3 . (124) 

I1I2 

In the absence of external torques (and ignoring the small amount of angular 
momentum in the spin system), J is fixed, but the rotational kinetic energy 
can change if energy is exchanged with the vibrational degrees of freedom of 
the grain. Thus we expect the angle /3 to fluctuate. Suppose, for example, 
that we start at /? = 0. Thermal fluctuations will cause f3 to “explore” the 
region 0 < /? < tt/ 2. If 

< kT , (125) 

hh 

then thermal fluctuations could allow j3 to reach tt/ 2: the maximum possi- 
ble rotational kinetic energy. At this point (3 can go either way, with equal 
probabilities, so the grain may “flip over” to the state 7 t/2 < f3 < tt. 

Now consider the effects of H 2 formation torques. These torques are fixed 
in body coordinates. If the grain “flips” from /3 = 0 to /3 = tt, the torques 
change sign in inertial coordinates! As a result, if the grain flips frequently, 
the H 2 torques can be averaged out to zero, and have no significant effect on 
the grain dynamics, so that the grain rotation remains essentially thermal! 
Lazarian & Draine (1999a) refer to this as “thermal trapping”. 

The question now comes down to estimating the rate of flipping, and 
how the rate depends on grain size. Lazarian & Draine (1999a) discussed 
the role which the Barnett effect (i.e., the electron spin system) can play 
in coupling the grain rotation to the vibrational modes, and concluded that 
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flipping would be rapid enough to result in thermal trapping for grain sizes 
a^O.l/im. This seemed to successfully account for the absence of grain 
alignment for a ^ 0.1 /rm. 

However, further analysis of grain dissipational processes showed that 
the nuclear spin system could be important at the lower rotational rates 
appropriate to larger grains (Lazarian & Draine 1999b). It was found that 
the coupling of grain rotation to the nuclear spin system would lead to rapid 
thermal flipping even for grains as large as ~ 1 ^m! 

Thermal flipping therefore now appears able to prevent H 2 formation 
torques from being able to achieve superthermal rotation even for the a « 
0.2 fxm grains which are observed to be aligned! How is grain alignment to be 
achieved? 

7.7 Effect of Radiative Torques 

So long as the grain surface is not modified, the H 2 formation torques are fixed 
in body coordinates, and thermal flipping will cause these torques to average 
to zero if the grain spends equal amounts of time in the two “flip states” 
(i.e., P < 7t/2 and P > tt/ 2). However, the grain is also subject to other 
systematic torques, the most important of which is the radiative torque due 
to anisotropic starlight. This torque is not fixed in body coordinates, so that 
even if the grain spends equal amounts of time in the two flip states, there is a 
secular change in angular momentum due to the radiative torques. Radiative 
torques are unimportant for small a 0.05 /xm grains, but very important for 
a ^ 0.1 /im grains. It therefore appears that “thermal trapping” will prevent 
small (a iS 0.05 /xm) grains from achieving suprathermal rotation, but that 
larger (a 0.1 /xm) grains can achieve suprathermal rotation, following which 
they can be gradually aligned by the Davis-Greenstein mechanism. 

But life turns out to be more complicated. Draine & Weingartner (1997) 
found that radiative torques act not only to spin up grains, but also to change 
their alignment. The characteristic time scale for radiative torques to change 
the grain alignment turns out to be just the drag time r^rag, which is shorter 
than the Davis-Greenstein alignment time, so that the radiative torques ap- 
pear able to dominate over the systematic effects of the Davis-Greenstein 
alignment torque. 

Study of three different grain shapes and different angles between the 
magnetic held direction and the starlight anisotropy direction showed that 
under some conditions the starlight torques brought about alignment (the 
aligned state was an “attractor”), while under other conditions the grain did 
not settle down in any stationary state. There appeared to be an overall 
tendency for grain alignment to take place, but the sample of 3 grain shapes 
was insufficient to reach general conclusions. 

Further study of this problem is underway, and it is hoped that in a year 
or two we will have definite conclusions on the role of radiative torques in 
the grain alignment problem. But for the moment the situation appears very 
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promising - we seem to finally understand the absence of alignment of small 
grains, and it appears that the physics we now understand will explain the 
alignment of the larger ones, with radiative torques due to starlight playing 
a critical role. 

7.8 Alignment and Disalignment in Different Regions 

There is evidence that the degree of grain alignment varies from one region to 
another. Observations of polarization of starlight indicate that the a 0.1 /rm 
grains in diffuse clouds are substantially aligned, but JHK observations of 
quiescent dark clouds (Goodman et al. 1995) indicate that the dust grains in 
the dark interiors are not aligned. How can we understand this? 

There are several possible plausible answers: 

• The inner dark regions have insufficient starlight for the radiative torques 
to achieve grain alignment. 

• The inner dark regions are molecular, so that H 2 formation torques can- 
not drive suprathermal rotation. 

• Photoelectric emission is suppressed in the inner dark regions. 

It is also possible, of course, that either the magnetic field structure is com- 
pletely tangled in the interior, or the grains in the central regions are for 
some reason more spherical, but these do not seem plausible. 

It is also interesting to note that in some dark clouds the grains are rea- 
sonably well aligned: for example, the M17 molecular cloud (Fig. 37) shows 
linear polarization (averaged over the 22" beam) as large as 5%. Note, though, 
that although this cloud is “dark” (i.e., we cannot see into it at optical wave- 
lengths) it is full of stars - the infrared spectrum indicates a grain tempera- 
ture T « 45 K, and the grains must be heated by starlight: the mean starlight 
intensity must be ~ (45/20)® « 10^ times greater than the ISRF in order to 
heat the grains to this temperature! It therefore appears likely that starlight 
torques could drive the grains in this cloud to suprathermal rotation rates. 

Finally, a recent study by Padoan et al. (2001) concluded that submm 
polarization maps of quiescent cores were consistent with a model where the 
grains in the outer parts of the cloud with Ay iS 3 mag were aligned, while 
grains in regions with Ay > 3 mag were randomly oriented. 

All of the above appear consistent with the idea that starlight torques 
are an essential part of the grain alignment process. Grains deep in quiescent 
dark clouds are unable to align because starlight torques are weak or absent, 
whereas starlight torques are present are do produce alignment within star- 
forming clouds. 

7.9 Summary of the Grain Alignment Story 

The grain alignment story has been long and complicated, with a number of 
false turns. The discussion above has skipped some of the important physics 
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Fig. 37. 60 /im linear polarization toward M17. Flux contours are at 20%, 
30%,. . . ,90% of the peak flux. From Dotson et al (2000). 



involving coupling of the grain rotation to the electron and nuclear spin 
systems (see Lazarian & Draine 1999a, b) as well as the important question 
of the grain dynamics during “crossovers”, when reversals of the systematic 
torques on the grain cause the grain angular momentum to become small 
- during this time the grain is very vulnerable to disalignment (see, e.g., 
Lazarian & Draine 1997). 

To summarize: the important elements of the grain alignment story now 
appear to be as follows: 

• Grains are subject to 2 important systematic torques: 

— H 2 formation torques, 

— radiative torques due to starlight. 

• For grains with rotational kinetic energies less than a few times fcTu, 
thermal fluctuations allow the grain to sample different orientations of 
the principal axes relative to the instantaneous angular momentum J, 
leading to “thermal flipping” of the grain between the two “flip states” . 

• If the thermal flipping is sufficiently rapid, the grain will spend 50% of 
the time in each flip state, in which case the H 2 formation torque, which 
is fixed in body coordinates, will integrate to zero in inertial coordinates: 
the H 2 formation torque will be unable to drive the grain to suprathermal 
rotation. 
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• Starlight torques are not fixed in body coordinates, and hence do not 
integrate to zero even if the grain spends equal amounts of time in the 
two flip states. 

• For grains with a 0.1 /im in regions with sufficiently strong starlight, 
the systematic torque due to starlight will be able to drive the grain to 
suprathermal rotation. 

• Once a grain achieves suprathermal rotation, thermal dipping will cease, 
the grain will settle into a state where the principal axis ai is either 
parallel or antiparallel to J, and all systematic torques will operate si- 
multaneously. 

• Grains in suprathermal rotation are essentially immune from disalign- 
ment by random collisions. Alignment will result from the combined ef- 
fects of Davis-Greenstein alignment and starlight torques. 

• Under some circumstances, starlight torques do not result in strong align- 
ment. However, numerical experiments suggest that, averaged over irreg- 
ular grain shapes, starlight torques lead to overall alignment. 

• The fact that starlight torques are important only for grains with 
a ^ 0.1 fj,m appears to account for the observation that a ^ 0.05 /xm grains 
in diffuse clouds are not aligned, while a 0.1 /xm grains appear to be 
well-aligned. 

8 Evolution of the Grain Population 

The interstellar grain population in a galaxy is the result of a rich and com- 
plicated mix of processes. 

We would like to know the composition of interstellar dust, how and where 
it forms, and the extent to which individual grains are either homogeneous or 
conglomerate. We would like to know what the size distribution of the differ- 
ent types of grains is, how and where the size distribution is fashioned, and 
the manner in which the size distribution varies from one region to another. 
If we understood the balance of processes acting in our part of the Milky 
Way, perhaps we could deduce how grains might differ in the inner or outer 
parts of the Milky Way, and in the LMG, SMG, and other galaxies. 

Unfortunately, the overall grain evolution problem seems to me to be 
beyond our grasp at this time - I don’t think we understand the parts well 
enough to justify trying to put them together as a whole. At this time, we 
have to try to clarify the individual elements of the story. 

8.1 Dust Formation in Stellar Outflows 

There is abundant evidence that dust grains form in some stellar outflows: we 
see infrared emission from dust around red giants, around carbon stars, and 
in planetary nebulae. In all these cases the dust condensed out of material 
which at an earlier time was completely gaseous. Observations show that 
different types of stellar outflows form different types of dust. 
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The dust around red giants with 0/C > 1 shows a 10 /rm emission feature 
which is due to the Si-0 stretching mode in amorphous silicate material; 
around some OH/IR stars the 10 /xm feature appears in absorption (e.g., 
Demyk et al. 2000), along with emission features revealing the presence of 
crystalline silicates. 

The dust around carbon stars, on the other hand, does not show the 10 /rm 
feature, which is consistent with the fact that we do not expect silicates to 
form in an atmosphere with 0/C < 1. Some carbon stars (e.g. IRC-l-10216) 
instead show a 11.3 /xm emission feature which is attributed to SiC (e.g., 
Blanco et al. 1998). 

It should also be mentioned that we find presolar grains in meteorites 
whose isotopic composition clearly indicates formation in outflows from 
evolved stars (Hoppe & Zinner 2000); these include nanodiamonds, SiC, 
graphite, and AI2O3. Some of these grains may have formed in supernova 
ejecta. 

So there is no question that stellar outflows contribute dust to the inter- 
stellar medium. However, this does not necessarily imply that the bulk of the 
dust in the interstellar medium was formed in stellar outflows. 

The physics and chemistry of dust formation in stellar outflows is com- 
plex: the material is generally far from thermodynamic equilibrium, and the 
outflows themselves are often (perhaps always?) hydrodynamically complex 
- neither steady nor spherically symmetric. We can not yet reliably describe 
the details of the grain formation process. 

8.2 Grain Growth in the ISM 
8.2.1 Accretion in the ISM 

Dust grains in the interstellar medium can grow, since atoms from the gas 
can stick to them. We can calculate the lifetime against accretion for an atom 
or ion in the gas phase: 

fdaTTa^^D{a) , (126) 

\TTmuJ J da 

where Ai is the mass number of the ion, Si is the sticking coefficient, and rid (a) 
is the number density of grains with radii less than a. The enhancement factor 
D{a) gives the collision rate relative to what it would be in the absence of 
electrostatic effects: 

D{a) = J2fiZd,a)B{Z^,a) , (127) 

Zd 

where 

I' exp [—Z^ZiC^/akT^ for Z^Z\ > 0 
(1-^) forZgZi<0 

/ ^2 2 \ 1/2 

^ + ( 2akT ) = 0 



R(Zd,o) 



(128) 
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Figure 38 shows the collisional enhancement factor D{a) calculated by Wein- 
gartner & Draine (1999) for carbonaceous and silicate grains in various en- 
vironments. Enhancement factors D ^ 10^ are found for a iS 10“^ cm grains 
in the CNM. As a result, an ion (e.g., Sill, Till, Fell) would have a lifetime 
against accretion of ~ 1 x lO^yr in the CNM if the grain size distribution 
includes the ultra-small grain population of Fig. 10. Therefore we expect that 
those metals which can “stick” to the small grains will be strongly depleted 
from the gas phase; presumably those elements which are not strongly de- 
pleted (e.g., Na, K, S) for some reason do not remain on the ultra-small grains 
after they collide. 



8.2.2 Mantle Formation 

As discussed in §1.8, grains in dark molecular clouds are apparently coated 
with ice “mantles”. The overall mantle composition is dominated by H 2 O ice. 
It is not known whether the H 2 O molecules arrived from the gas phase and 
then froze onto the grain, or whether the H 2 O formed by surface chemistry. 
In most clouds the gas phase H 2 O abundance is very low, and the observed 
mantles would not be able to form by accretion, suggesting that O and H 
atoms arriving at the grain react there to form H 2 O (Jones & Williams 1984). 




Fig. 38. Collision enhancement factor for positively charged ion (see text). From 
Weingartner & Draine (1999). 
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However, it has recently been proposed that the H 2 O could be produced in 
the gas phase during occasional passage of a shock wave, and then quickly 
accreted onto the grain surfaces (Bergin et al. 1999). Other molecules (e.g., 
CO) may be mixed with the H 2 O, or in some cases condensed separately, 
as the absorption profile in some cases requires that the CO not be mixed 
with polar molecules. This could result if the H 2 O was deposited while the 
grain was too warm to retain CO, with the CO later freezing out when the 
conditions changed and the grain became colder. 



8.2.3 Coagulation 

The grain population in normal diffuse clouds has a very large population 
of very small grains, required to explain the rapid rise in extinction at short 
wavelengths. In dense regions, however, the extinction at short wavelengths 
appears to be reduced. This can only happen if some of the small grains are 
either destroyed altogether, or simply coagulated onto big grains. The latter 
seems more likely. 

What is the time scale for such coagulation? This depends on the velocity 
differences between different grains. Grain-grain velocities can be the result 
of differential drift velocities (see §5.5), hydrodynamic turbulence (see, e.g., 
Draine 1985), or MHD turbulence (Lazarian & Yan 2002). 

Grains larger than ~ 10“^ cm contribute a geometric cross section per H 
27 « 0.5 X 10“^^ cm^. If the characteristic grain-grain velocity difference is 
Avd, then the grain-grain coagulation time would be 

r 1-1 /30cm“^\ /0.1kms“^\ 

tcos.g=[riYiSAvd\ =2xl0®yr( — — — jf — - — j , (129) 

so that grain coagulation will not be very important in diffuse clouds, but 
could proceed in dense regions. 

8.3 Photolysis 

When grains with ice mantles are returned to diffuse regions and exposed to 
ultraviolet radiation, it appears that most of the material is driven off, but 
the ultraviolet radiation photolyzes the ice to produce an organic refractory 
residue (e.g., Greenberg et al. 2000). This residue may be responsible for the 
interstellar 3.4 /rm “aliphatic C-H” absorption band. 

8.4 Photo-desorption 

The process of photo-desorption is potentially very important, but is unfor- 
tunately not well understood because of limited laboratory studies. Photo- 
desorption is a quantum process whereby absorption of a single photon can 
lead to ejection of an atom or molecule from a solid. For example, an absorbed 
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photon might cause an atom or molecule on a substrate to be excited to a 
repulsive electronic state which would then be ejected from the substrate. 

Photo-desorption was discussed by Draine & Salpeter (1979b), who ar- 
gued that a molecule in a surface monolayer might have a photo-desorption 
cross section as large as ^ 10“^® cm^. 

If an atom or molecule adsorbed on a grain surface has a cross section 
Cpd = 10“^®CT_i8Cm^ for 8 < hiy < 13.6 eV photons, the photo-desorption 
rate for an adsorbed atom or molecule would be ~ 6 x 10 “^^(Ti 8S“^ in the 
diffuse ISM, since the density of 8 — 13.6 eV photons is ~ 2.0 x 10“®cm“® 
in the MMP ISRF. Laboratory studies show that H 2 O ice has a photo- 
desorption cross section CTpd « 8 x 10~^® cm^ at A = 1215A (Westley et 
al. 1995a, 1995b), so that it seems likely that at least some species will have 
(j_i8 A. 1, with photo-desorption rate 6 x 10“^^ s“^. Note that this esti- 
mated photo-desorption rate is comparable to photo-dissociation rates for 
diatomic molecules in the interstellar radiation field. ^ 

O is the most abundant species after H and He. The probability per unit 
time that an O atom will arrive at a single surface site (of area ~ 10“^® cm^) 
is ~ 2 X 10“^® s“^ in the CNM. It is therefore apparent that any species with 
CTi8 A 0.01 will be photo-desorbed before accretion of anything chemically 
interesting (other than H) can take place on top of it. 

We conclude that photo-desorption most likely plays a dominant role in 
determining what kind of accretional growth takes place on a grain. It is no 
surprise that the noble gases (He, Ne, Ar) do not deplete, but the fact that 
some chemically-reactive elements (e.g., Na, K, S) appear to undergo minimal 
depletion, at least in the diffuse ISM, may be due to photo-desorption.® Lab- 
oratory studies of photo-desorption from carbonaceous or silicate substrates 
would be of great value to clarify why some elements deplete and others do 
not in the diffuse ISM. 

8.5 Grain Destruction in Shock Waves 

From time to time a shock wave will pass through an element of interstellar 
gas, compressing, heating, and accelerating the gas (Draine & McKee 1993), 
and creating conditions under which grain destruction can occur (Draine 
1995). Shock waves can be driven into cold cloud material ahead of an ion- 
ization/dissociation front (Bertoldi & Draine 1996), as the result of energetic 
outflows from newly-formed stars, or as the result of the explosion of a nearby 
supernova. 

It is easiest to study the shock in the “shock frame” where the shock front 
is stationary: in the upstream direction the grains and gas move together 

^ Roberge et al. (1991) estimate photo-dissociation rates 8.6 x 10~^° s“^ and 3.5 x 
10~^°s“^ for CH and OH, respectively. 

® The only other depletion-suppressing process would be chemisputtering by H 
atoms: an impinging H atom might react with the adsorbed atom, and the reac- 
tion energy might eject the resulting hydride. 
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toward the shock with a speed equal to the shock speed Vs- The fluid is then 
decelerated at the shock front, with the post-shock gas velocity Vg « Ug/4 for a 
strong shock. The post-shock gas is initially hot. If the shock is strong, and we 
can neglect the effects of a possible magnetic precursor, the gas temperature 
will rise to 



T. 
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= 2300 X K 
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where /i is the mass per free particle (/i/mn ~ 1. 4/1.1 = 1.27 for cold atomic 
clouds, and /i/mn = 1.4/0. 6 = 2.33 for molecular gas, where we assume 
He/H=0.1). 

The grains move across the shock front into the post-shock fluid, where 
they initially have velocity Vg — Vg ^ (3/4)us relative to the gas. The grains 
now begin to decelerate by gas drag, but are also acted on by the magnetic 
held B and electric held E = — Vg x B/c (evaluated in the shock frame). As 
a result, the grain population develops grain-grain velocity differences which 
can be of order the shock speed Vg, or even larger if conditions are favorable 
for “betatron” acceleration of the dust as the density and magnetic held in- 
crease in cooling post-shock gas (Spitzer 1976; Draine & Salpeter 1979b). 



8.5.1 Grain-Grain Collisions 

In the absence of betatron acceleration, a grain will slow down by a factor 
1/e when it has collided with its own mass of gas. If all grains were identical, 
then the chance that a grain would run into another grain before appreciable 
slowing-down would be of order the dust-to-gas mass ratio of ~ 0.01, and we 
can therefore expect that a typical grain (one representative of the bulk of 
the grain mass) will have a probability of ~ 0.01 of collision with another 
comparable grain with a grain-grain velocity difference ~ Vg. 

The physics of grain-grain collisions has been discussed by Tielens et al. 
(1994). For refractory grains one expects virtually complete destruction if 
the energy in the center-of-mass system is sufficient to vaporize both grains. 
For identical grains, mean atomic mass 20 amu, and binding energy per atom 
of 5.7 eV (these numbers are appropriate to MgFeSi04 silicate) complete 
vaporization could occur in a head-on collision between identical grains with 
a velocity difference of 15 km s“ ^ , so we may expect vaporization of perhaps of 
^ 1% of the grains in a hydrodynamic shock with shock speed Vg 20km s“^. 
At lower grain-grain collision speeds, shattering may occur. Thus we should 
expect that several percent of the large dust grains will be shattered in shocks 
with Vg ^ 10km s“^ . Grain-grain collisions may be responsible for maintaining 
the population of small grains in the interstellar medium. 
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Fig. 39. Sputtering yields for H and He on carbon estimated by Barlow (1978), 
Draine & Salpeter (1979a), and Tielens et al. (1994), and compared with laboratory 
data. 



8.5.2 Sputtering 

In a fast shock, the grain material can be eroded by the process of “sput- 
tering”, where individual atoms or ions from the gas collide with the grain, 
occasionally ejecting one of the grain atoms. In interstellar gas, the sputtering 
rate depends mainly on the sputtering “yields” Y (E) for H and He projec- 
tiles incident on silicate or carbonaceous target materials. In ionized gas the 
sputtering rates will be affected by the grain charge. An expression for the 
rate of sputtering of a moving grain, and estimates for sputtering yields Y, 
are given by Draine & Salpeter (1979a). 

In the initial post-shock region where the gas is hot, the sputtering is due 
to a combination of thermal energy and the motion of the grain through the 
gas with velocity ~ 3ug/4. If the gas cools before the grain slows down, then 
subsequent sputtering is entirely due to the rapid motion of the grain through 
the gas, with He atoms having kinetic energies 4 times larger than H atoms. 
The sputtering continues until the grain slows to the point where the kinetic 
energy of the impinging He atoms is too low for appreciable sputtering. 

Detailed models of grain destruction in shock waves are in general agree- 
ment that shock speeds Vs ^ 100km s“^ result in substantial grain destruc- 
tion, with essentially complete grain destruction for > 200km s“^ in the 
case of a radiative shock - see Fig. 41 (Draine & Salpeter 1979b; Jones et al. 
1994). 

These estimates for grain destruction appear to be in agreement with the 
observation that high-velocity gas (which has been shocked) tends to have 
enhanced gas phase abundances of elements, such as Si, which are normally 
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Fig. 40. Sputtering yields for H and He on silicate material estimated by Barlow 
(1978), Draine & Salpeter (1979a), and Tielens et al. (1994). 





Fig. 41. Fractional grain destruction calculated for graphite and silicate grains in 
interstellar shock waves in low density gas. Figure from Jones et al. (1994). 



heavily depleted in interstellar gas, an effect first noted for Ca by Routly & 
Spitzer (1952). Figure 42 shows this effect for Si. 



8.6 Thermal Sublimation 



Ice mantles on grains can be removed if the grains become too warm. What 
is the critical grain temperature for ice mantle removal? 

The probability per unit time for a surface molecule to desorb is approx- 
imately 



dP 

dt 






-Uo/kT 



(131) 
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Fig. 42. Observed gas-phase Si abundances: evidence for refractory grain destruc- 
tion in high-velocity shocks. Solid, dashed, and broken lines are theoretical models 
discussed by Cowie (1978). Figure from Cowie (1978). 



where Uq is the binding energy of the molecule, and vq « 10^^ is a char- 
acteristic vibration frequency. If there are iV^ono monolayers of ice on the 
grain and we want it to survive a time At, the grain temperature should not 
exceed 



Tsuh 



Uolk 

In ( t / A^m on o ) 



(132) 



Some heating phenomena are brief (e.g., a stellar flare or a supernova) but 
we are usually interested in longer time scales. For example, a massive star 
in or near the cloud might have a lifetime At « 5 x 10® yr. We can write 



Uo/Mk 

1 + 0.018 ln[(Z\t/10®yr)(103/fV„,ono)] 



(133) 



Thus, a H 2 O ice mantle {Uq = 0.5 eV) 10® monolayers thick will survive 
10® yr provided T < 108 K. Therefore we should not be surprised to find 
H 2 O ice in clouds where the grain temperature is 80 K (as in the molecular 
cloud OMC-1), but it would be a great surprise to find H 2 O ice if the grain 
temperature were, say, 120K. 



8.7 Gamma Ray Bursts 

Gamma ray bursts may destroy dust grains out to a substantial distance. 
While this is probably uninteresting in terms of evolution of the overall grain 
population in galaxies, it may nevertheless be of observational interests be- 
cause it affects the observable properties of gamma-ray bursts. This is of 
considerable interest because there is evidence that gamma-ray bursts are 
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associated with star-forming regions in galaxies, and yet the optical after- 
glows from a number of bursts show little evidence for reddening. 

Grain destruction near GRBs has been discussed by Waxman & Draine 
(2000), Fruchter, Krolik & Rhoads (2001), and Draine & Hao (2002). There 
are two separate mechanisms which could be important. Thermal sublimation 
(as discussed above, but now with larger binding energies Uo/k 7 x 10^ K 
and shorter times At « 10 s for the optical transient, so the grain temperature 
can rise to ~ 3000 K. Draine & Hao find that for a reasonable (but very 
uncertain) estimate of the GRB peak optical luminosity, dust grains can be 
destroyed out to distances of ~ 5 pc. 

A more exotic grain destruction mechanism may operate; if effective, it 
could destroy grains out to greater differences. This is the mechanism of 
“Goulomb explosion” (Waxman & Draine 2000). The grains will be quite 
highly charged due to the intense X-ray irradiation from the GRB. If the 
grains are sufficiently weak, the electrostatic stresses on the grain might con- 
ceivably tear it apart. Fruchter et al give arguments in favor of this outcome, 
in which case GRBs could destroy grains out to quite large distances. 

8.8 Time Scales 

When all grain destruction processes are considered, it is found that sput- 
tering in shock waves dominates the overall destruction of refractory grains. 
The frequency with which grains are overtaken by such shocks depends on 
the frequency of the supernovae which drive these blastwaves, and on the 
structure of the interstellar medium, since material in dense regions is “shel- 
tered” from the effects of a passing blastwave. The problem is complex, and 
therefore there are many places where different assumptions could alter the 
quantitative conclusion, but Draine & Salpeter (1979b) estimated a lifetime 
~ 1.3 X 10® yr for silicate material in a diffuse cloud, while Jones et al. (1994) 
find a lifetime 2.2 x 10® yr for silicate grains. 

Draine & Salpeter (1979b) stress that this relatively short lifetime was 
impossible to reconcile with the idea that “interstellar grains are stardust”: 
nearly all of the Si in the interstellar medium is in solid form (e.g., see Fig. 9, 
showing that only 2.5% of the Si is in the gas phase in the cloud toward ()Oph). 
The “turnover” time of the interstellar medium is equal to the ratio of (mass 
in gas)/(rate of mass going into stars), or ~ 5x lO®Af0/5M0 yr“^ « 1x10® yr. 
Suppose that all the Si leaving stars entered the ISM in solid form, but there 
was no grain growth in the ISM. A grain destruction time ~ 2 x 10®yr - 
only ^20% of the turnover time - would imply that ~ 80% of the Si would 
be found in the gas phase. However, observations of diffuse clouds typically 
show < 10% of the Si in the gas gas phase (see, e.g.. Fig. 9). 

The ineluctable conclusion is that most of the Si atoms in grains must 
have joined those grains in the ISM through some process of grain growth. 
The kinetics of accretion implies that only in denser regions can accretion 
rates be large enough to remove Si from the gas fast enough to maintain the 
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low observed gas-phase abundances. This in turn requires that there must 
be fairly rapid exchange of mass between the dense and less dense phases of 
the ISM. A more quantitative discussion of the required mass exchange rates 
may be found in Draine (1990) and Weingartner & Draine (1999). 



9 Effects of Dust on Interstellar Gas 

9.1 Photoelectric Heating 

Photoelectrons emitted from dust grains depart with nonzero kinetic energy. 
This is often the dominant mechanism for heating interstellar gas, whether 
in diffuse clouds or in photo-dissociation regions. 

Photoelectric heating rates have been re-estimated recently by Bakes & 
Tielens (1994) and Weingartner & Draine (2001c). For starlight with the spec- 
trum of the ISRF, Weingartner & Draine find a heating rate per H nucleon 
F/nn ~ 6x 10“^® ergs“^ for conditions characteristic of cool HI clouds. This 
heating rate is much larger than other heating mechanisms, such as ionization 
by cosmic rays or X-rays. 

9.2 H 2 Formation and Other Chemistry 

The H 2 molecule is central to interstellar chemistry. The rovibrational lines 
can be an important coolant for gas at temperatures 100 ^ T ^ 2000 K. 

The principal gas-phase formation processes are via H“ 

n + e- + hiy ; H--kH^H2-ke" (134) 

or 3-body collisions: 

R + }i + X^}i2 + X + AE (135) 

where the third body X could be H, He, or another H 2 . The H“ formation 
pathway (134) is slow because of the generally low abundance of H“ in diffuse 
gas, and the 3 body process (135) requires very high densities to become 
important. In the interstellar medium of the Milky Way, the dust abundance 
is such that H 2 formation is overwhelmingly due to grain surface catalysis. 
The grain catalysis process was first proposed by Gould & Salpeter (1963), 
and the basic picture has not changed: 

1 . An H atom arrives at a grain surface and “sticks” . 

2. The H atom diffuses over the grain surface until it becomes trapped by 
either chemisorption or physisorption. 

3. A second H atom arrives at the surface, sticks, diffuses over the surface, 
and encounters the trapped H atom. 

4. The two H atoms react to form H 2 , releasing ~ 4.5 eV of energy, and 
ejecting the H 2 from the grain surface into the gas. 
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Part of the 4.5 eV goes into overcoming the binding of the two H atoms to 
the surface, part goes into vibrational excitation of the grain lattice (“heat”), 
part appears as kinetic energy of the H2, and part appears in vibrational and 
rotational energy of the newly-formed H2 molecule. 

In the above picture, the H2 formation rate is 

= Rnnn{Il) (136) 
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f 1 dud 2 / ^ 
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(138) 



A = J da{l/nu){dnd/da)7ra^ (139) 

(7) = — y da (l/nH)(dud/da)7ra^7(a) (140) 

where 7(a) is the fraction of H atoms colliding with a grain of radius a which 
are converted to H2; 7 < 1 since not all impinging H atoms will “stick”, and 
some of those which stick may later be removed from the surface by some 
process other than H2 formation. Gould & Salpeter argued for 7 of order 
unity under interstellar conditions. 

Dust models which reproduce the observed extinction of starlight have 

A= [da—^7ra^>m-^\m^ (141) 

J nn da 

For example, the dust model of Weingartner & Draine (2001a) has A = 
6.7 X 10“^^ cm^/H. The H2 formation “rate coefficient” R can be determined 
from ultraviolet observations of H2 in diffuse regions; Jura (1975) found R « 
3 X 10“^^cm^s“^, which implies (7) « 0.06. It appears, then, that the very 
small grains which dominate the grain surface area (in particular, the PAHs) 
must not be efficient catalysts for H2 formation. 

How might H2 formation on the smallest grains be suppressed? The time 
scale between photon absorption events is 

Tabs«10®s(10A/a)3 , (142) 



while the time scale between arrival of H atoms is 

2 



th «8 x 106 s 7^ 



(")■ 



30 cm 
nn 



-3 



looKy/^ 



T 



(143) 



so it is possible that an H atom physisorbed on a very small grain may be 
removed when the grain is heated by a photon absorption event, so that 
only rarely would an arriving H atom find another H atom with which to 
recombine. The efficacy of this process depends on the binding energy of the 
H atom. 
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9.3 Grains as Sources of Complex Molecules 

Observational studies of star-forming regions sometimes find molecular abun- 
dances in the gas phase which are difficult to understand in the context of 
pure gas-phase chemistry. An extreme example of this is the observation 
of D 2 CO/H 2 CO = 0.003 toward Orion (Turner 1990), or D 2 CO/H 2 CO = 
0.03 — 0.16 toward the low-mass protostar IRAS 16293-2422 (Ceccarelli et 
al 2001). Such extreme deuteration seems impossible to envision in the gas 
phase, but could occur by chemistry on grain mantles during a precollapse 
phase, with the molecules put into the gas phase when the dust is warmed 
by energy from the protostar. There seems little doubt that at least some 
regions have molecular abundances which are heavily influenced by grain 
surface chemistry. 

9.4 lou Recombiuatiou ou Dust Graius 

In diffuse clouds, the ion/neutral fraction elements with ionization potentials 
< 13.6 eV is indicative of the relative rates for photoionization of starlight vs. 
recombination of ions with electrons. It is often assumed that the only channel 
for neutralization of metal ions X’*' is radiative recombination, X’*' -|- e“ — >■ 
X -|- hv. However, collisions with neutral or negatively charged grains can 
be more important than radiative recombination for neutralization of metal 
ions for conditions typical of the diffuse interstellar medium (Weingartner 
& Draine 2001d). Weingartner & Draine provide estimates for the effective 
“rate coefficient” for ion neutralization via collisions with dust grains. Re- 
combination on dust grains is particularly effective for protons in HI regions, 
and therefore is involved in regulating the free electron density. 

9.5 Coupliug Neutral Gas to Maguetic Fields 

In gas of low fractional ionization, a significant fraction of the “free” charge 
present may be located on dust grains. The fraction of the free charge residing 
on the grains depends very much on the numbers of very small dust grains, 
but appears likely to become significant in regions of low fractional ionization 
rielnu ^ 10“® (Draine & Sutin 1987). 

Coupling of the magnetic field to the gas is due to the force exerted on 
moving charges; in regions of low fractional ionization, the neutral gas will be 
unaffected by the magnetic field unless the charged species are collisionally 
coupled to the neutral atoms and molecules. Because dust grains have rela- 
tively large physical cross sections, the charge trapped on dust grains can be 
important at coupling the magnetic field to the neutral gas in MHD shocks in 
gas with fractional ionization rig/nH iS 10“^ (Draine 1980; Draine, Roberge 
& Dalgarno 1983). 
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9.6 Dust Grains as Magnetometers? 

Magnetic fields are dynamically important in the ISM, but difficult to observe 
remotely. Nature has been kind enough to strew microscopic magnetometers 
- dust grains - throughout the interstellar medium. Unfortunately we have 
not yet figured out how these magnetometers work, so that even if we could 
measure their degree of alignment, we would not be able to determine the 
magnetic field strength. But we can use dust grains as “compasses” to indicate 
the direction of the magnetic field. 

As discussed above, dust grains have their angular momenta J systemat- 
ically aligned with the magnetic field Bq, at least in regions where sufficient 
starlight is present. Here we review the main features of this alignment: 

• It is virtually certain that the sense of alignment is to have J || Bq (to 
within a sign - parallel and antiparallel are equally favored) . 

• Suppose the grain has 3 principal axes ai, a 2 , and a 3 , with moments of 
inertia /i > ?2 > ? 3 - If the grain has rotational kinetic energy Arot > 
(3/2)fcTgr, then the grain will tend to have its shortest axis ai || J so as 
to minimize its rotational kinetic energy at constant J. In regions where 
the grains are cooler than the gas (the most common situation), this is 
expected to be true for both large grains (which rotate suprathermally) 
and small grains (which rotate thermally). 

• Thus we expect the grains to have their “long axes” (a 2 , as) _L Bq. 

• Starlight extinction will be greatest for starlight with E || the long axes, 
and therefore we expect the transmitted starlight to have E || Bq. 

• Far-infrared and submm emission will be polarized || to the grains’ long 
axes, and hence we expect the FIR/submm emission to be polarized with 
E_L Bq. 

Therefore we can obtain a map of the magnetic field by either measuring 
the polarization of starlight for many stars, or by mapping the polarized 
far-infrared or submm emission from the cloud. CCDs and infrared detectors 
make possible extensive measurements of starlight polarization, even in dense 
regions, and FIR/submm arrays (e.g., SCUBA on the JCMT) make possible 
measurements of polarized submm emission. 

Grains are likely to have a fairly high polarizing efficiency in the 
FIR/submm. Define 

Cpoi ^ ^ [C(E II a 2 ) + C(E II ag)] - G(E || ai) (144) 

Gran = ^ [G(E II ai) + C(E || a2) + G(E || ag)] (145) 

If the grains were spherical, there would of course be no polarization in ex- 
tinction or emission. Figure 43 shows how the polarizing efficacy depends on 
the axial ratio of the dust grains. The grains are assumed to be either oblate 
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axial ratio = (intermediate + long)/(2xshort) 



Fig. 43. Polarization efficiency Cpoi/Cran for spheroidal dust grains (oblate and 
prolate) as a function of the axial ratio. From Padoan et al. (2001). 



or prolate spheroids, and the dielectric function of the grains is taken to be 
that of “astronomical silicate” at 350 /rm. 

For perfect spinning alignment, the maximum polarization is obtained for 
a sightline _L Bq: 



jj _ C 2 + C 3 — 2Ci _ Cpol/Cran 

J max — T 

C 2 + C 3 + 2 C 1 1 - i ^ 

D ^ran 

Suppose the grain shape has (intermediate+long/ (2x (short) = 1.4 (e.g., axial 
ratios 1 : 1.2 : 1.6). Then Cpoi/Cran ~ 0.5 for astronomical silicate. Perfect 
alignment would give polarization in emission P = 0.5/(l — 0.08) = 0.55. 
However, the largest observed polarizations are ~ 10%, implying either 

• imperfect alignment 

• less elongation 

• disorder in the magnetic field on short length scales. 

Unfortunately, it will be difficult to determine the relative importance of 
these three effects. For perfectly aligned grains in a uniform magnetic field 
perpendicular to the line-of-sight, P = 0.1 would be produced by grains with 
axial ratio of only ~ 1.05 - very minimal elongation. 





298 



B.T. Draine 



10 Concluding Remarks 

There are many unanswered questions in the astrophysics of interstellar dust 
grains in cold clouds. The questions are interesting in their own right, but 
one is also driven to answer them by the need to understand dust grains well 
enough to use them as diagnostics of interstellar conditions, and to under- 
stand the effects which dust grains have on the dynamics and evolution of 
interstellar gas. With the advent of powerful infrared and submillimeter ob- 
serving facilities - including SIRTF, SOFIA, ALMA, Herschel, and Planck 
- we can anticipate that there will be progress on the astrophysics of dust, 
driven by the challenge of understanding the new data. An exciting decade 
lies before us! 

I thank Robert Lupton for making available the SM software package. 
This work was supported in part by NSF grant AST 9988126. 
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From left to right, Bruce Draine, Andrew Blain and Frangoise Combes, almost 
done with the week of lectures, are still listening carefully to the audience last 

questions. 





Index 



2MASS, 120 
2dF, 3 

AAS, 59 

accommodation coefficient, 268 
ACS, 5 

Advanced Camera for Surveys, 5 
afterglow, 19, 292 

AGN, 8-10, 20, 27, 57, 63, 72, 154, 161, 
164 

ALMA, see Atacama Large Millimeter 
Array 

angular momentum, 49 
ASTRO-F, 92 

Atacama Large Millimeter Array 
(ALMA), 10, 11, 20, 32, 40, 66, 298 

background radiation, 23 
bar, 119, 154 
Barnett dissipation, 274 
Barnett effect, 273, 276 
Bell Labs survey, 115 
big-bang, 30 
BIMA, 58, 152 
black body, 163 
blackhole, 8, 9, 26 
BLAST, 92 
blazars, 82 
BOLOCAM, 69 
BOOMERanC, 30, 34, 57, 61 
brown dwarf, 199 

CBI, 30, 57, 59, 61 
CCD, 6, 20, 27, 296 
CDM, see cold dark matter 
CELT, 32 

Cfa-Harvard survey, 114 
Chandra, 3, 60, 85 
Clausius-Mossotti, 229 



CMB, see Cosmic Microwave Back- 
ground 

CNM, 264, 266, 276, 285 
coagulation, 286 

COBE, 17, 33, 56, 118, 191, 197, 222 

cold dark matter (CDM), 29, 146 

collapse, 43, 132 

Columbia-CFA, 114 

comoving, 37 

Compton, 90 

confusion noise, 5 

Copernicus satellite, 108 

Coriolis forces, 147 

Cosmic Microwave Background (CMB), 
2, 11, 29, 30, 33, 40, 46, 54 
cosmological constant, 190 
CSC, 79 
CUDSS, 66 
cyclotron period, 260 

Damped Lyman-a systems (DLA), 202 
DASI, 30, 57, 61 

Davis-Greenstein mechanism, 277, 279, 
280 

de Sitter, 36 
DEIMOS, 89 
density fluctuations, 195 
desorption, 192 

DIBs, see diffuse interstallar bands 

dielectric function, 228 

diffuse interstallar bands (DIBs), 218 

dipole approximation, 232 

DIRBE, 17, 118, 222, 245 

DLA, see Damped Lyman-a systems 

DMR, 56 

Drude profile, 217 

dwarf galaxy, 145, 150 

EBCM method, 231 




306 



Index 



Eddington limit, 200 
efficiency factors, 226 
EGRET, 124 
Einstein, 36 
Einstein-de Sitter, 43 
electric dipole, 105, 172 
Eley-Rideal process, 192 
EROs, see Extremely Red Objects 
ESE, see extreme scattering event 
Euler equation, 40 
extreme scattering event (ESE), 126 
Extremely Red Objects (EROs), 52, 63, 
184 

FCRAO, 149 

feedback, 9, 14, 21, 31, 44, 80, 137, 144, 
160 

fillin g factor, 110, 112, 129, 135, 150, 
164, 172, 177, 186 
FIRAS, 17, 222, 245 
FIRST, 5, 55 
flaring, 123 

forbidden velocities, 118 
Fourier, 45 

fractal, 125, 128, 134, 137 
fragmentation, 146 
free-free emission, 89 
Friedmann-Robertson- Walker (FRW) , 
34, 35, 38 

FRW, see Friedmann-Robertson- Walker 
fundamental plane, 27 
FUSE, 108 

G-dwarf problem, 124 
galactic shear, 126, 130, 147 
gamma-ray burst (GRB), 19, 33, 88, 
291 

gas drag, 257 
GET, 92 

general relativity (GR), 33, 35 
Giant Molecular Glouds (GMG), 126, 
137, 154, 202 

GMG, see Giant Molecular Glouds 
GMOS, 89 
GMRT, 3 

GR, see general relativity 
gravothermal catastrophe, 133 
GRB, see gamma-ray burst 
Greenbank Telescope, 92 
GRO, 124 



Gunn-Peterson effect, 65 

Ha, 85, 142, 154, 160 
Harrison-Zeldovich, 46 
Hausdorff dimension, 125, 127 
HDF, see Hubble Deep Field 
HEROs, 88 

Herschel Space Observatory, 5, 298 
heterodyne, 6 

High Velocity Clouds (HVC), 108, 123 
Highly ERO, 88 
horizon, 37 

HST, see Hubble Space Telescope 
Hubble, 33, 41, 47 
Hubble constant, 180 
Hubble Deep Field (HDF), 2, 5, 27, 55 
Hubble Space Telescope (HST), 4, 137, 
161 

HVC, see High Velocity Cloud 
ICM, 59 

IGM, see intergalactic medium 
IMF, see initial mass function 
inertial moment, 105 
inflation, 29 

Infrared Astronomy Satellite (IRAS), 

5, 13, 17, 21, 24, 27, 55, 156, 183, 222 
Infrared Space Observatory (ISO), 5, 
17, 28, 54, 55 

initial mass function (IMF), 26, 134, 
199 

intergalactic medium (IGM), 18, 19, 

29, 32, 40, 48, 59, 64, 65, 201 
International UV Explorer, 65 
interstellar radiation field (ISRF), 4 
intracluster medium , 59 
IRAM, 10, 158, 179, 183 
IRAS, see Infrared Astronomy Satellite, 
128 

IRTS, 222, 246 
ISFR, 90 
ISM, 7, 8 

ISO, see Infrared Space Observatory, 
see Infrared Space Observatory 
ISRF, see interstellar radiation field, 
244 

James Webb (Next Generation) Space 
Telescope (JWST), 11, 32, 67 
Jansky, 22 




Index 



307 



JCMT, 17, 183, 296 
Jeans analysis, 40 

Jeans length, 128, 133, 138, 162, 193, 
197 

Jeans mass, 193, 197, 198, 201 
JHK, 281 

JWST, see James Webb (Next 
Generation) Space Telescope, 79 

K-correction, 51, 82 
Keck-NIRC, 66 
Kolmogorov cascade, 130 
Kramers- Kroning relations, 235 

Langmuir-Hinshelwood process, 192 
Large Magellanic Cloud (LMC), 108, 
168, 283 

Large Velocity Gradient (LVG), 166 

LBGs, see Lyman-break galaxies 

linear dichroism, 273 

LMC, see Large Magellanic Cloud, 224 

logatropic equation, 132 

Lorentz force, 260 

Low Surface Brightness Galaxy (LSB), 
145, 150, 158 
LRIS, 80 

LSB, see Low Surface Brightness 
Galaxy 

luminosity function, 21, 27 
LVG, see Large Velocity Gradient, 186 
Lyman-a, 18-20, 52, 65, 89, 176, 193, 
197 

Lyman-a forest, 175, 201 
Lyman-break galaxies (LBGs), 29, 62 

Mach number, 266 
MACHOs, 199, 201 
Magellanic Clouds, 111 
Magorrian relation, 9, 26, 81 
MAMBO, 62, 65, 183 
MAXIMA, 57 
MERLIN, 3 
metallicity, 149 
MHD, 286, 295 
Mie theory, 228, 231, 241, 263 
Millimeter Array (MMA), 10 
MIPS, 18 

MMA, see Millimeter Array 
molecular cloud, 120 
molecular ring, 117, 118 



Muler matrix element, 226 

N-body, 31, 80, 147 
neutron star, 26 
NGST, 201 

NRAO-Kitt Peak survey, 115 
nucleosynthesis, 30 

OB star, 8, 144 

opacity, 10, 14, 66, 82, 89, 186, 191, 
225, 227, 235 
Ostriker-Vishniac, 60 
overdensity, 40, 43, 45 
OVRO, 10, 56, 58 

PAH, see polycyclic aromatic hydrocar- 
bon 

paramagnetic dissipation, 277 
Pauli exclusion principle, 105 
PdBI, see Plateau de Bure 
photo-desorption, 262, 265, 286 
photo-ionization, 106 
photoelectric yield, 253 
photolysis, 286 
Plank Surveyor, 59, 61, 298 
Plateau de Bure (PdBI), 10 
Poisson, 40 

Polar Ring Galaxies (PRG), 155 
polarization, 215, 273, 296 
polycyclic aromatic hydrocarbon 
(PAH), 73, 217, 220, 223 
population III, 198 
power spectrum, 33, 47 
Press-Schechter, 45, 47, 190 
PRG, see Polar Ring Galaxies 

QSO, see quasar 
quantum fluctuation, 29 
quasar (QSO), 8, 14, 18, 20, 27, 51, 
160, 170, 175 
quintessence, 36 

radiation pressure, 261, 271 
radiative torque, 270, 280, 282 
radiative transfer, 10 
Rayleigh limit, 224 
Rayleigh-Gans, 240 
Rayleigh- Jeans, 15, 53, 112 
recombination, 30 
redshift, 77 
Rees-Sciama, 60 
Reynolds number, 129 




308 



Index 



ROSAT, 241 
Rowland effect, 273, 275 
SAFIR, 5, 32 

Safronov- Toomre criterion, 139 
Saskatoon, 56 

scattering, 11, 215, 226, 231 
Schechter function, 21 
Schechter luminosity, 42 
Schmidt law, 142 

SCUBA, 17, 62, 65, 67, 79, 84, 183, 296 
SDSS, 3 

SED, see spectral energy distribution 
Serkowski law, 215, 216 
SEST, 59 

Seyfert galaxy, 160 
SHARC-II, 69, 79 
Sheth-Tormen, 48 
SINGS, 18 

SIRTF, see Space Infra-Red Telescope 
Facility 
SKA, 94 

SMA, see SubMillimeter Array 
Small Magellanic Cloud (SMC), 283 
Small Magellanic Cloud SMC), 247 
SMC, 224, see Small Magellanic Cloud 
SOFIA, 298 

Space Infra-Red Telescope Facility 
(SIRTF), 5, 18, 54, 245, 298 
SPECS, 32 

spectral energy distribution (SED), 10, 
12, 23, 51, 63, 70, 161 
SPH, 134 
SPICA, 5 
SPIRIT, 94 
sputtering, 289 

star formation, 8, 14, 21, 142, 160, 182, 
201 

starburst, 139, 144, 183 
sticking coefficients, 251 
stochastic heating, 242 
Stokes vector, 227 
streaming motion, 115 
Subaru Deep Field, 63 
SubMillimeter Array (SMA), 9 
Sunyaev-Zeldovich (SZ), 57, 93 
supernova, 19, 31, 33, 81, 88, 144 
supernova remnant, 90 
suprathermal roation, 274 
suprathermal rotation, 278, 283 



surface density, 21, 64, 70, 72, 111, 121, 
123, 138, 141, 142, 151, 201 
SuZIE, 56 
synchrotron, 90 
SZ, see Sunyaev-Zeldovich 

T-matrix method, 231 
thermal flipping, 265, 279, 282 
thermal sublimation, 290 
tidal torque, 50 

tiny scale atomic structures (TSAS), 
126 

Toomre criterion, 121 

TSAS, see tiny scale atomic structures 

Tully-Fisher, 151 

turbulence, 129, 131 

UIE, 65 
UK-ATC, 93 

ULIRGs, see Ultra Luminous Infra Red 
Galaxies 

Ultra Luminous Infra Red Galaxies 
(ULIRGs), 107, 139, 159, 162, 164, 
166, 169 

UV lines, 108, 109 
UV photons, 105 
UV radiation, 107, 123 

valence band, 253 
VIMOS, 89 
VLA, 3, 55 
VLB A, 178 
VSA, 30, 57, 61 

warp, 121, 123 

WFPC-II, see Wide Eield and 
Planetary Camera 
white dwarf, 26 

Wide Field and Planetary Camera 
(WFPC-II), 5 

Wilkinson Microwave Anisotropy Probe 
(WMAP), 30, 46, 57, 61 
WMAP, see Wilkinson Microwave 
Anisotropy Probe 
WNM, 266 

X-ray, 91 
x-ray, 240 
XMM, 3 

Zeldovich pancakes, 29 
zodiacal light, 51 




